We are celebrating the 10-year anniversary of the Alteryx Community! Learn more and join in on the fun here.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Removing text from Cell that does not fit pattern

jzlamal
5 - Atom

I have an Excel file that has additional commentary added to Cells and multiple values that I need to capture also in the same cell.  The value of a single cell in the spreadsheet may look something like:

This is my comment number 1

This is my comment number 2 

John:Silver

This is. my comment number 3

Andrew:Gold

This is my comment number 4

 

What I am trying to extract from this cell is:

 

John:Silver

Andrew:Gold

 

I have been reading about text to column parsing but that seems to expect the data to be more structured.   I am currently experimenting with the RegEx parsing option, but I am not sure it would return the multiple values in the cell.   Has anyone done this already?   

 

Thank you!

John

 

3 REPLIES 3
binuacs
21 - Polaris

@jzlamal One way of doing this with the Regex_Tokenize method

binuacs_0-1679606774901.png

 

gaoa
11 - Bolide

Hi @jzlamal 

Another way is to use text to column tool with the configuration below, then filter out the lines that contain the word "comment".

gaoa_0-1679607255220.png

gaoa_1-1679607437946.png

jzlamal
5 - Atom

Thanks!   This solution provided the exact behavior I was seeking.   Appreciate the help!

Labels
Top Solution Authors