read multiple PDF files in Alteryx and convert (output) them to Excel workbook?
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi Community,
I have a batch of 50+ PDF files that I would like to convert to Excel workbook.
keeping the same name for each file.
Is it possible using Alteryx?
so 50PDF's -->> 50 excel files
any help/support would be much appreciated.
- Labels:
- Documentation
- Settings
- Updates
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Yes, can be done, but you'll need an external tool to capture the contents of the PDF.
Maybe you can use some R or Python packages to get the contents from within the PDF files first.
This macros can give you a good starting point:
https://gallery.alteryx.com/#!app/PDF-Input--Text-and-Image-/5be5ec8d0462d71ffce6deaa
https://gallery.alteryx.com/#!app/PDF-Input/5b685aff0462d710907f7a3b
Hope this helps pointing you in the right direction.
Best
_AG_
PS: I used PDF2Text for a similar project, so it can be helpful too.
