Alteryx Server Discussions

Find answers, ask questions, and share expertise about Alteryx Server.

read multiple PDF files in Alteryx and convert (output) them to Excel workbook?

gurpreet-singh
7 - Meteor

Hi Community, 

 

I have a batch of 50+ PDF files that I would like to convert to Excel workbook. 

keeping the same name for each file. 

 

Is it possible using Alteryx?

 

so 50PDF's -->> 50 excel files

 

any help/support would be much appreciated. 

1 REPLY 1
Aguisande
15 - Aurora
15 - Aurora

Hi @gurpreet-singh 

Yes, can be done, but you'll need an external tool to capture the contents of the PDF.

 

Maybe you can use some R or Python packages to get the contents from within the PDF files first.

 

This macros can give you a good starting point: 

https://gallery.alteryx.com/#!app/PDF-Input--Text-and-Image-/5be5ec8d0462d71ffce6deaa

https://gallery.alteryx.com/#!app/PDF-Input/5b685aff0462d710907f7a3b

 

Hope this helps pointing you in the right direction.

Best

_AG_

 

PS: I used PDF2Text for a similar project, so it can be helpful too.