Important Community update: The process for changing your account details was updated on June 25th. Learn how this impacts your Community experience and the actions we suggest you take to secure your account here.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

PDF to Excel data export issue

NiranjanK1
8 - Asteroid

Hi All,

 

im trying to export the PDF data as it is to Excel but it is not working as expected. Kindly help me to resolve this issue. I have attached expected Excel Output file.

 

Thanks

Niranjan

23 REPLIES 23
FinnCharlton
13 - Pulsar

@NiranjanK1 , How are you importing the PDF into Alteryx? Can you share the flow / a screenshot of the flow?

NiranjanK1
8 - Asteroid

@FinnCharlton  I tried with all avilable workflows avilable in Alteryx community but im getting error. none of workflows are working. I just tried with forum workflow. i havent build any, not sure where to start. 

Deano478
12 - Quasar

@NiranjanK1 for PDF inputs in general you will need to have Intelligence suit installed for you designer. 

FinnCharlton
13 - Pulsar

@NiranjanK1 , https://community.alteryx.com/t5/Community-Gallery/PDF-Input/ta-p/887038

 

this one has always worked for me, have you tried it? If so, what errors are you getting?

Felipe_Ribeir0
16 - Nebula

Hi @NiranjanK1 

 

You could either use some public gallery tool/alteryx intelligence suite/python/R. The problem that this pdf seems to have is that it does not contains basic references for the tool to parse it, like gridlines and proper alignment. I believe that you would have problems with any selected tool because of this, the tool wouldnt know how to properly separe the column/rows:

 

Felipe_Ribeir0_1-1676549041942.png

So, if you can talk with someone to configure gridlines/proper alignment for these files, it will help a lot.

 

I was able to parse it using python + tabula library with the attached workflow. But as you can see, the tool is not knowing how to do the job properly because of the above commented issues:

 

 

Felipe_Ribeir0_2-1676549489109.png

 

 

 

NiranjanK1
8 - Asteroid

@Felipe_Ribeir0 Thanks very much for your inputs i will talk to them definitly . 

NiranjanK1
8 - Asteroid

@FinnCharlton Sure, i will chec, thank you

NiranjanK1
8 - Asteroid

@Felipe_Ribeir0 It is not working for me, Do i need to install any package

Felipe_Ribeir0
16 - Nebula

Yes, run Alteryx as admin, so this piece of code will be run properly:

 

Felipe_Ribeir0_0-1676558784645.png

 

then change the directory tool to point to the directory that contains the pdf files and run the workflow.

Labels