Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Reading Scanned document with Merged Cells

Hamza_Abuali
7 - Meteor

I am trying to read a scanned PDF (I know this is a challenging, but it's not possible), like the table below:


Attached.png












I have successfully read the IDs and the Case numbers, but unfortunately, there is no fixed template for the Case # column, it could be  3,4,5, or even 10 people sharing the same case .
In alteryx, the data is only structured, so as an example, the Case number:11234/2024 would fill in line1, line 2, or it generates a third line and put the Case number and, in the other is null , like this :

IDCase #
111111234/2024
nullnull
2222null

or it could be like this :

IDCase #
1111null
null11234/2024
2222null


or like this:

IDCase #
1111null
nullnull
222211234/2024

 


there are lots of options, I think you got the point now.
What's the best possible way to map each ID with his Case??

1 REPLY 1
shancmiralles
11 - Bolide

@Hamza_Abuali  ,

 

this is quiet a challenge ..  from my end.. i'd rather fix this first in excel before transferring it on alteryx. 

Labels
Top Solution Authors