Alteryx IO Discussions

Richie059 · ‎10-27-2022

Good morning guys, thank you for the help in advance! So I'm new to the python SDK tool and I'm having serious difficulties with a tool developed by tlarsen7572 in this community post https://community.alteryx.com/t5/Dev-Space/Input-PDFs-they-are-images/td-p/404360/page/2 (The first reply on page two contains the SDK tool. Here is his link to the GitHub folder containing his code for the OCR tool https://github.com/ayx-builders/AlteryxTools/tree/master/OCR/OCR . In essence, the tool interfaces with Microsoft Azure (cloud based processing services, more specifically a service called "computer vision". where the tool sends images and PDFs to the Azure endpoint, waits for Azure to process the files, and then downloads and parses the results. The problem I'm having is that there is a two image limit and "built into" the code and (intentionally or unintentionally). I've been through all the .py files and cannot reason why this limitation is occurring. any help whatsoever would be greatly appreciated. Thank you so much!

tlarsen7572 · ‎11-03-2022

Unfortunately, the free tier of Computer Vision API 2.1 only reads the first 2 pages of a document:

You can read more about the API here:

https://westus.dev.cognitive.microsoft.com/docs/services/5cd27ec07268f6c679a3e641/operations/2afb498...

Alteryx IO Discussions

OCR Python SDK Tool (Two page limit)