Alteryx Designer Discussions

Find answers, ask questions, and share expertise about Alteryx Designer.
Andy Uttley, Alteryx ACE, makes music with Alteryx | Math + Music
SOLVED

Regular expression

Highlighted

Hi

I am new in Alteryx and have a problem creating the right regular expression.

 

I have a list of adresses. I have used this regular expressen: \d+\s.|\d+ (tokenize)

to get the "Husnr1" column. 

 

adresse1Husnr1vejnavn1vejnavn2vejnavn3
Egehegnet 2 C2 CEgehegnet  
Mariendalsvej 50 F,4 tv50 FMariendalsvejtv 
Dronninggårds Alle 138138DronninggårdsAlle 

 

For vejnavn1, vejnavn2 and vejnavn3 I have used this expression: 

\<\u\l+\> (tokenize). 

 

What I would like to get as an output is this:

 

adresse1Husnr1vejnavn1vejnavn2vejnavn3
Egehegnet 2 C2 CEgehegnet  
Mariendalsvej 50 F,4 tv50 FMariendalsvej  
Dronninggårds Alle 138138DronninggårdsAlle 

 

I want everything after the comma (eg 4 tv) to be removed. 

 

Thanks for your help in advance.

 

Regards, Trine

Highlighted
Alteryx Certified Partner
Alteryx Certified Partner

To remove everything after the comma you could just use a split-to-columns on comma delimiter.

 

I think your regular expression is correct. Just untick the Case Insensitive box. This will ensure that you tokenize on a word starting with a capital letter.

Labels