Advent of Code is now back for a limited time only! Complete as many challenges as you can to earn those badges you may have missed in December. Learn more about how to participate here!
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

OCR

Idyllic_Data_Geek
Asteroide

Can some one help me to create a workflow to read couple of fields from a scanned letters? The files are .pdf extension. I'm relatively new to Python so any help is greatly appreciated. I can not use R as I'm having issues with company deployed R package for alteryy in my work. Thnk you

8 RESPOSTAS 8
fmvizcaino
17 - Castor
17 - Castor

Hi @Idyllic_Data_Geek ,

 

Here is a link for a pdf tool that is capable of reading images.

https://gallery.alteryx.com/#!app/PDF-Input--Text-and-Image-/5be5ec8d0462d71ffce6deaa

 

You will need to open your alteryx designer as an admin and may need to install some packages as well. Here you can check how to install packages.

https://community.alteryx.com/t5/Alteryx-Designer-Knowledge-Base/How-To-Use-Alteryx-installPackages-...

 

Best,

Fernando Vizcaino

mutama
Alteryx Alumni (Retired)

Hi @Idyllic_Data_Geek ,

 

Have you tried the Alteryx Computer Vision Tools? 

 

You can see here for more info: https://community.alteryx.com/t5/Data-Science/Unlocking-Insights-from-Images-using-Computer-Vision/b...

 

Best,

Michael

Idyllic_Data_Geek
Asteroide

Idyllic_Data_Geek_0-1627485604245.png

When I download the workflow then it is showing me the above. How do I get the tool?

Idyllic_Data_Geek
Asteroide

Is this the tool? WHy is it giving me error? Do I need to configure something? I simply opened a pdf file and connected the view...It is giving me the below error

Idyllic_Data_Geek_0-1627485796690.png

 

fmvizcaino
17 - Castor
17 - Castor

Hi @Idyllic_Data_Geek ,

 

You need to open your alteryx as an administrator to allow the tools to install the proper packages. You can do it by right-clicking the Alteryx icon.

 

I would suggest you trying the computer vision tools as @mutama mentioned. It is part of an additional package and maybe you need to request the account manager for an additional trial license, but I can say that it is a great package and it will become a lot easier for you to tackle OCR problems.

 

Best,

Fernando Vizcaino

Idyllic_Data_Geek
Asteroide

What are the proper packages? No my firm won;t upgrade to the latest or additional features at a premium cost!

fmvizcaino
17 - Castor
17 - Castor

Hi @Idyllic_Data_Geek ,

 

The PDF Reader is R-based and needs to install 2 packages, PDF tools and tesseract.

 

Best,

Fernando Vizcaino

 

Idyllic_Data_Geek
Asteroide

I already installed both these tools! How can I validate my installation, please?

Rótulos
Autores com maior número de soluções