Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.
Announcement | We'll be doing maintenance between 2-3 hours, which may impact your experience. Thanks for your patience as we work on improving the community!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

How to rotate the text in a PDF file, so as the text to be read by Alteryx properly

EveSt
5 - Atom

Hello All,

 

I have been researching the community discussions for my case for months and as it turns out I am the first that have this obstacle. The problem is in the layout of the page in a PDF document and more specifically the orientation of the text on the page. I tried the customized tools: PDF Input (with R-based macro) , PDF Input Text-and-Image, and from Text Mining in Alteryx Intelligence Suite PDF Input and Image to Text so as to read PDF document with different text orientation. The text, situated horizontally and is readable from left to right is correct, but when the page is rotated vertically and the text starts from the bottom to the top, then this text is not read properly in a comprehensible way.

 

EveSt_2-1619622319716.png

 

The pages in the PDF document looks like this way:

EveSt_1-1619622240935.png

 

I would appreciate any idea and some guidance to find a solution for this problem. I suppose a customized tool, using Python is needed.

 

Many thanks in advance for every cooperation

 

2 REPLIES 2
TrevorS
Alteryx Alumni (Retired)

Hello @EveSt 
Thanks for posting on the Community!
I looked into this with some of my support team and it doesn't appear that this is currently possible to do within Designer.
Right now the recommendation is to make the changes to the PDF so the text is correct before loading them into Alteryx.
You can also submit an idea for this if you would like to see this as a potential feature enhancement.
Thanks,
TrevorS

Community Moderator
Xervarian
6 - Meteoroid

This would be a big WIN if Alteryx could provide rotate functionality within the image template tool within the computer vision suite: or even better, could the OCR read vertical tables :) ALTERYX FTW

Labels