Alteryx IO Discussions

Customize and extend the power of Alteryx with SDKs, APIs, custom tools, and more.

OCR Python SDK Tool (Two page limit)

Richie059
7 - Meteor

Good morning guys, thank you for the help in advance! So I'm new to the python SDK tool and I'm having serious difficulties with a tool developed by tlarsen7572 in this community post https://community.alteryx.com/t5/Dev-Space/Input-PDFs-they-are-images/td-p/404360/page/2 (The first reply on page two contains the SDK tool. Here is his link to the GitHub folder containing his code for the OCR tool https://github.com/ayx-builders/AlteryxTools/tree/master/OCR/OCR . In essence, the tool interfaces with Microsoft Azure (cloud based processing services, more specifically a service called "computer vision". where the tool sends images and PDFs to the Azure endpoint, waits for Azure to process the files, and then downloads and parses the results. The problem I'm having is that there is a two image limit and "built  into" the code and (intentionally or unintentionally). I've been through all the .py files and cannot reason why this limitation is occurring. any help whatsoever would be greatly appreciated. Thank you so much!

1 REPLY 1
tlarsen7572
11 - Bolide
11 - Bolide

Unfortunately, the free tier of Computer Vision API 2.1 only reads the first 2 pages of a document:

tlarsen7572_0-1667476076064.png

 

 

You can read more about the API here:

https://westus.dev.cognitive.microsoft.com/docs/services/5cd27ec07268f6c679a3e641/operations/2afb498...