This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
The new Text Mining tools mentioned above work great if you have a file that you want to pull very specific pieces of information from - think data coming from an invoice or a form. They are an additional package you'd need to purchase, but work amazing for these situations.
If you need to pull data from large tables in PDF that span multiple pages, I suggest using the older PDF tool. It can be found here:
@Graceyahiro - I am on an older version of Alteryx and also Intelligent Suite in 2020.2 comes with a cost. I use R library (Pdftools) or Tesseract to parse my PDFs....... when I have multiple PDF files, I have created a macro that would do the job for me. Below is an article that you may find interesting and has some further links to read: