Hi All,
im trying to export the PDF data as it is to Excel but it is not working as expected. Kindly help me to resolve this issue. I have attached expected Excel Output file.
Thanks
Niranjan
@NiranjanK1 , How are you importing the PDF into Alteryx? Can you share the flow / a screenshot of the flow?
@FinnCharlton I tried with all avilable workflows avilable in Alteryx community but im getting error. none of workflows are working. I just tried with forum workflow. i havent build any, not sure where to start.
@NiranjanK1 for PDF inputs in general you will need to have Intelligence suit installed for you designer.
@NiranjanK1 , https://community.alteryx.com/t5/Community-Gallery/PDF-Input/ta-p/887038
this one has always worked for me, have you tried it? If so, what errors are you getting?
Hi @NiranjanK1
You could either use some public gallery tool/alteryx intelligence suite/python/R. The problem that this pdf seems to have is that it does not contains basic references for the tool to parse it, like gridlines and proper alignment. I believe that you would have problems with any selected tool because of this, the tool wouldnt know how to properly separe the column/rows:
So, if you can talk with someone to configure gridlines/proper alignment for these files, it will help a lot.
I was able to parse it using python + tabula library with the attached workflow. But as you can see, the tool is not knowing how to do the job properly because of the above commented issues:
@Felipe_Ribeir0 Thanks very much for your inputs i will talk to them definitly .
@FinnCharlton Sure, i will chec, thank you
@Felipe_Ribeir0 It is not working for me, Do i need to install any package
Yes, run Alteryx as admin, so this piece of code will be run properly:
then change the directory tool to point to the directory that contains the pdf files and run the workflow.