Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Reading multiple pdf files with diffrent formats

MAYANK_F
5 - Atom

Hi everyone i want to build an iterative macro to input multiple pdf files with inconsistent format and then output certain records that are common in that but i am not able to get a starting point around this use case.

1 REPLY 1
griffinwelsh
12 - Quasar

@MAYANK_F This depends on how inconsistent your data is. If the files are sometimes image based text you will need to use OCR otherwise you can just fetch the text. Either way you need to use Python with a package like pymupdf or the computer vision tools from intelligence suite.

Labels