Hi, I'm testing Intelligence Suite to read data from PDF to excel. Even with clear PDF, some information is not correctly being pulled. Examples are like:
1. the date should be 10/27/2025, it will pull as 40/27/2025.
2. It will read 5 as S.
3. If there are 2 S next to each other in a 17 digits' number, it will add 8 after the 2 S and makes it a 18 digits' number.
How to correct those mis-read information?
4. If a row information was input shown as 2 rows' information, everything after that will move down one row even though the layout in PDF looks the same and the other highlighted annotations will be screwed up. How to align it in Alteryx?
Thanks!
Theresa
Hi there!
I used the computer vision tools in a training session at Alteryx Inspire 2025.
I noticed that of the 4 sample pdfs we were using, there was one value that did not load correctly. Seems like this is the same issue you are facing.
I asked the presenter about it, he said "Good catch."
At this point, I believe this is a bug. I would recommend reporting the error.
Thank you for the suggestion. I've just sent it to our account manager to see if they have solution to fix those issues. Thanks!
I have had a lot of luck using the PDF Input tool below. This includes reading dates and values properly from PDF invoices. Give it a try!
PDF Input Tool