Join the Inspire AMA with Joshua Burkhow, March 31-April 4. Ask, share, and connect with the Alteryx community!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Parsing Data with Regex

cherylwinschuh
6 - Meteoroid

Hello,

 

I am working with data that comes in from PDF using the PDF input tool. (not the auto insights PDF tool) 

 

Using regex I was able to break the data into columns. I am now running into an issue where some of the elements I used are not in the data.

example. I searched the data for "CP" and replaced it with |CP.  if the line does not have CP, it's throwing off my columns.

(\s[C]+[P]) replaced with |CP

 

I tried adding a | as a delimiter where there were 2 or more spaces, some lines do not have 2 spaces between "columns"

\s{2}\b replaced with |

 

The PO number and Invoice can vary, some do not have letters in front of them.

 

Does anyone know of a better formula/regex to parse the data and not have misalignment due to missing information?

 

thanks

Cheryl

 

1 REPLY 1
QuentinS
Alteryx
Alteryx

hi @cherylwinschuh ,

 

this would be my approach, please see the attached workflow.

Labels
Top Solution Authors