Community Spring Cleaning week is here! Join your fellow Maveryx in digging through your old posts and marking comments on them as solved. Learn more here!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

OCR

Idyllic_Data_Geek
8 - Asteroid

Can some one help me to create a workflow to read couple of fields from a scanned letters? The files are .pdf extension. I'm relatively new to Python so any help is greatly appreciated. I can not use R as I'm having issues with company deployed R package for alteryy in my work. Thnk you

8 REPLIES 8
fmvizcaino
17 - Castor
17 - Castor

Hi @Idyllic_Data_Geek ,

 

Here is a link for a pdf tool that is capable of reading images.

https://gallery.alteryx.com/#!app/PDF-Input--Text-and-Image-/5be5ec8d0462d71ffce6deaa

 

You will need to open your alteryx designer as an admin and may need to install some packages as well. Here you can check how to install packages.

https://community.alteryx.com/t5/Alteryx-Designer-Knowledge-Base/How-To-Use-Alteryx-installPackages-...

 

Best,

Fernando Vizcaino

mutama
Alteryx
Alteryx

Hi @Idyllic_Data_Geek ,

 

Have you tried the Alteryx Computer Vision Tools? 

 

You can see here for more info: https://community.alteryx.com/t5/Data-Science/Unlocking-Insights-from-Images-using-Computer-Vision/b...

 

Best,

Michael

Idyllic_Data_Geek
8 - Asteroid

Idyllic_Data_Geek_0-1627485604245.png

When I download the workflow then it is showing me the above. How do I get the tool?

Idyllic_Data_Geek
8 - Asteroid

Is this the tool? WHy is it giving me error? Do I need to configure something? I simply opened a pdf file and connected the view...It is giving me the below error

Idyllic_Data_Geek_0-1627485796690.png

 

fmvizcaino
17 - Castor
17 - Castor

Hi @Idyllic_Data_Geek ,

 

You need to open your alteryx as an administrator to allow the tools to install the proper packages. You can do it by right-clicking the Alteryx icon.

 

I would suggest you trying the computer vision tools as @mutama mentioned. It is part of an additional package and maybe you need to request the account manager for an additional trial license, but I can say that it is a great package and it will become a lot easier for you to tackle OCR problems.

 

Best,

Fernando Vizcaino

Idyllic_Data_Geek
8 - Asteroid

What are the proper packages? No my firm won;t upgrade to the latest or additional features at a premium cost!

fmvizcaino
17 - Castor
17 - Castor

Hi @Idyllic_Data_Geek ,

 

The PDF Reader is R-based and needs to install 2 packages, PDF tools and tesseract.

 

Best,

Fernando Vizcaino

 

Idyllic_Data_Geek
8 - Asteroid

I already installed both these tools! How can I validate my installation, please?

Labels