Hi everyone,
I am working on a webscraping project in which I have already scraped the pdfs from the link https://www.extremenetworks.com/support/end-of-sale-and-end-of-support-products/ (table 1). I am now working on extracting the tabulated data from these pdfs. When using the pdf reader tool the some tables are aligned and others are have very little alignment, as seen below compared to the data in the pdf: (I also attached the workflow, pdf and xlsx file) I would like to know if anyone knows how I can align the data. Does anyone have some ideas or have solved a problem like this before? Wanted to use the extract the pdfs as images, but data is missing from the tables if I use these computer vision tools.


Thank you for helping!
Rouche