Bring your best ideas to the AI Use Case Contest! Enter to win 40 hours of expert engineering support and bring your vision to life using the powerful combination of Alteryx + AI. Learn more now, or go straight to the submission form.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

PDF to text

IoannaT
5 - Atom

I need to extract a specific table from a 30-page PDF file for further data processing.

I have access to the Intelligence Suite if that's helpful for this task.

Can anyone advise if this is feasible using Alteryx, and if so, what steps or tools within Alteryx I should use to accomplish this?

Thank you in advance for your guidance!

2 REPLIES 2
bryan06
7 - Meteor

You could try:

-Pdf to text, extract content from the pdf file 

-Browse tool, inspect the extracted dat to locate your table

-Filter tool, Narrow down the data base on text pattern

-text to column- split the table in to separate columns

-Formula tool, clean any inconsistencies

-output results

 

Hope this help reply if any further issues 

Chaoued
7 - Meteor
Labels
Top Solution Authors