Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

RegEx Question

XxparsegodxX
5 - Atom

Hi Alteryx Community!

 

I am currently using the following RegEx expression to extract only words. I.e. to take out groupings of words until the phrase hits a digit. (\D+)

 

Input: Example (Example) Text ($M) (7) 235,547 225,788 218,227 4 8

 

However, the RegEx doesn't work when there is an unwanted brackets number i.e. the (7)

Current Output: Assets Under Management (AUM) - average ($M) (

 

Desired Output: Assets Under Management (AUM) - average ($M)

 

I then use a Right & Length formula to extract the numbers from the cell.

 

Let me know if any of the above doesn't make sense and I can help clarify...thanks everyone!! (first time posting on alteryx community so super excited :D)

 

 

4 REPLIES 4
estherb47
15 - Aurora
15 - Aurora

Hi @XxparsegodxX 

 

Your RegEx statement is working, in that it's stopping as soon as it finds a digit, which is after the open parenthesis.

 

Does the rest of your data follow the same pattern? That is, numbers in brackets followed by groups of numbers? If so, I'd use a Formula tool with the following 2 expressions:

 

First: Regex_Replace([TextField],"\d","") to replace all of the numbers with nothing

 

Then: Replace([TextField],"()","") to get rid of the empty parentheses.

 

A Data Cleansing tool can get rid of all of the extra spaces.

 

I'm sure there's a more elegant solution, but my travel-addled brain isn't coming up with one at the moment.

 

Let me know if that helps.

 

Cheers!

Esther

fmvizcaino
17 - Castor
17 - Castor

Hi @XxparsegodxX ,

 

For that, I think you can use the following regular expression: ([^\d]+)\s.*?\d

 

Best,

Fernando Vizcaino

 

XxparsegodxX
5 - Atom

Hi @fmvizcaino

 

This is just what I was looking for! Can you explain the logic please? I'm not great with regex and would love to learn

fmvizcaino
17 - Castor
17 - Castor

Sure @XxparsegodxX !

 

\s.*?\d

 

\s.*     This means to identify an space followed by any character multiple times (in our case, it could be .{0,1} which means any character 0 or 1 time

?\d     Until it finds the first digit

 

Best,

Fernando Vizcaino

Labels