Hello,
I am trying to use the PDF input tool for the first time. The workflow so far appears like the below:
It has been sitting on 0% across all tools for around 20mins. Not sure if this is relevant in the slightest but on task manager tesseract.exe is nailing most of the CPU %.
The directory only has 2 PDFs held within so I'm not sure why it would be taking so long, the subsequent tools aren't doing very much other than basic filtering. Any help greatly appreciated.
Kind regards
Solved! Go to Solution.
Would you please try to use the PDF input building block directly to point Alteryx to the folder where your PDFs are saved?
Cheers!
Many thanks for your response, I placed the filter for 'page 1' prior to the 'Image to text' tool which significantly reduced the processing time (2 minutes).