ACT NOW: The Alteryx team will be retiring support for Community account recovery and Community email-change requests Early 2026. Make sure to check your account preferences in my.alteryx.com to make sure you have filled out your security questions. Learn more here
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

parsing pdf or doc files

Keerthana_Adamana
6 - Meteoroid

hi,

I would like to parse a folder which contains both pdf and doc file types(for resume parsing). Is there any way i can achieve?

2 REPLIES 2
bertal34
9 - Comet

@Keerthana_Adamana 

 

PDF parsing will require an AIS license which includes the Computer Vision tools. For ms word documents, check out the thread below.  I was able to take the "Docx Input" macro from @RogerS and tweak it for my use case.  To input multiple docx files, you can place this macro inside of a batch macro allowing you to feed in multiple file paths and output data from all files.

 

https://community.alteryx.com/t5/Alteryx-Designer-Desktop-Discussions/Input-Data-from-Word-document-...

 

alexnajm
19 - Altair
19 - Altair

To add onto @bertal34 's note, if you don't have the Intelligence Suite tools then there's a macro that leverages R to read in PDFs on the Community: PDF Input - Alteryx Community

Labels
Top Solution Authors