Alteryx Designer Discussions

Find answers, ask questions, and share expertise about Alteryx Designer.
The Expert Exam is now live online! Read about the specifics and what it took to bring it to life in the blog by our very own Elizabeth Bonnell!

PDF to Tabular

Highlighted
6 - Meteoroid

Hi All,

 

Here I share with you a PDF to Tabular Tool to extract tabular data from several pdf files.
This tool is based on the following post:
https://community.alteryx.com/t5/Alteryx-Designer-Discussions/Extracting-Tabular-Data-from-PDF-Docum...

 

In the backend process, I use Camelot package in python which can be seen in the following documentation
https://camelot-py.readthedocs.io/en/master/

 

This tool requires a Row Tolerance and input folder path containing multiple pdf files. In the output anchor, we will provide the tabular data along with Table Number, File Name, and pdf path information.

 

PDF to Tabular.jpg

 

 

 

 

 

 

 

 

 

 

 

Feel free to edit the python code and add new features to the macro. Thank you!!

 

Highlighted
6 - Meteoroid

great work!

Labels