Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.
Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Rookie Question- How to delete extra characters

leo2017
5 - Atom

How to remove unwanted characters in  the string 1,285<sup id="cite_ref-4" class="reference"><a href="#cite_note-4">[4]</a></sup></td>

so that I get only 1285 in my cleaned data.

 

What function do I use?

 

Thanks

2 REPLIES 2
KaneG
Alteryx Alumni (Retired)

You have a couple of options.

 

The simplest to work in your situation is:

Replace(Left([Text],5),',','')

 

which will take the left 5 and then replace the comma with nothing. A more dynamic way would involve making the number part dynamic by using Findstring:

Replace(Left([Text],FindString([text],'<')),',','')

 

You could also do it with REGEX, allowing you to add in other patterns easily later. The 3 formulas that you would want are:

REGEX_Replace([text],'\[.*?\]','')  // Replaces square brackets and anything in between with nothing
REGEX_Replace([text],'[[:punct:]]','') // Replaces any punctuation with nothing
REGEX_Replace([text],'<.*?>',''// Replaces triangle brackets and anything in-between with nothing

And so nested together would be:

REGEX_Replace(
REGEX_Replace(
REGEX_Replace([text],'<.*?>','')
,'\[.*?\]','')
,'[[:punct:]]','')

 

leo2017
5 - Atom

Thank you! problem solved.

Labels
Top Solution Authors