Hi:
I have a PDF which contains regular text and about 10 to 11 tables in between the text and I need to compare the values from these tables to the prior PDF to ensure consistency. Is there a way to extract the tables from the PDF in Alteryx into Excel?
Thanks,
Taki
Solved! Go to Solution.
Hi @Taki
Have you seen the latest release of the Alteryx Intelligence Suite? It allows for text extraction of individual elements from PDFs: https://www.alteryx.com/products/alteryx-platform/intelligence-suite
Hi @Taki
Alteryx has just released a new Intelligence Suite which includes the capability to extract data from images/pdf (and other cool features). Check out the website below, and take note that it comes at an addition cost:
alteryx.com/products/alteryx-platform/intelligence-suite
Otherwise, this is possible with R/Python in Alteryx. Here's some links to check out:
https://gallery.alteryx.com/#!app/PDF-Input--Text-and-Image-/5be5ec8d0462d71ffce6deaa
https://gallery.alteryx.com/#!app/PDF-Input/5b685aff0462d710907f7a3b