Hi everyone,
I have a question. Can Alteryx Designer with Intelligence Suite tools installed parse PDF or report files? In the attached PDF example, I would like to create separate fields for all of the fields in this report (Product Number, Product Description, Unit Cost, etc.) and have it read these fields for all of the records in the PDF file. The approach I am thinking of would be similar to what Monarch or IDEA does.
I am not sure if this functionality exits in Alteryx any help or assistance with this would be greatly appreciated. I am attaching an example PDF file.
Thanks in advance!
Mike
Solved! Go to Solution.
You can do this in Intelligent Suite. Or R. Or Python. You're going to need to think about your logic buckets and how you choose what values get allocated where (and knowing Regex helps) but it's possible. Play around with it!
Hi @mkeiffer
I like to use the tabula python library to read tables from pdf files, its easy, free and it works well. This example is already parsing your pdf file.
1-Close Alteryx and Run it as an Administrator of the system to be able to install the libraries.
2-Change the directory tool input parameter to the folder that has your pdf files.
@apathetichell and @Felipe_Ribeir0 - Thank you to both of you for taking the time to answer my question. Both answers were extremely helpful! Thanks to both of you and I love being a part of this online Community. Have a great weekend!
@mkeiffer if you try to do it with the intelligence suite, please share your impressions about it here. I would like to know how it is done there/if it works well/if it is easy to configure too from another user perspective.
Have a great week!
@Felipe_Ribeir0 I will certainly do that! Thank you again so much for your first response and your assistance with this. It is very much appreciated!
Mike