We are celebrating the 10-year anniversary of the Alteryx Community! Learn more and join in on the fun here.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

PDF tools - parse data

luhaatyrau
5 - Atom

Hi,

 

I have attached a file with two tabs.  The 1st tab is the data which needs to be parsed.  The 2nd tab is the format which I expect to see.

I don't know how to parse the data which was truncated into 2 lines.

 

Your help would be greatly appreciated.

 

Mary

1 REPLY 1
cgoodman3
14 - Magnetar
14 - Magnetar

This will be challenging to do in a dynamic way, such as the next time you read in a similar document. There’s lots of additional white space which needs to be cleaned up.

Depending on the format of the data in the PDF, such as is it already in a table, I’d suggest using the table detection option in the image template tool. This would give you an output field with the columns delimited by a ‘|’ (pipe delimiter), which would make parsing easier and a more consistent approach.


https://community.alteryx.com/t5/Data-Science/Unlocking-Insights-from-Images-using-Computer-Vision/b...

 

Chris
Check out my collaboration with fellow ACE Joshua Burkhow at AlterTricks.com
Labels
Top Solution Authors