Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Reading Scanned document with Merged Cells

Hamza_Abuali
7 - Meteor

I am trying to read a scanned PDF (I know this is a challenging, but it's not possible), like the table below:


Attached.png












I have successfully read the IDs and the Case numbers, but unfortunately, there is no fixed template for the Case # column, it could be  3,4,5, or even 10 people sharing the same case .
In alteryx, the data is only structured, so as an example, the Case number:11234/2024 would fill in line1, line 2, or it generates a third line and put the Case number and, in the other is null , like this :

IDCase #
111111234/2024
nullnull
2222null

or it could be like this :

IDCase #
1111null
null11234/2024
2222null


or like this:

IDCase #
1111null
nullnull
222211234/2024

 


there are lots of options, I think you got the point now.
What's the best possible way to map each ID with his Case??

1 REPLY 1
shancmiralles
11 - Bolide

@Hamza_Abuali  ,

 

this is quiet a challenge ..  from my end.. i'd rather fix this first in excel before transferring it on alteryx. 

Labels