Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Regular expression

Hi

I am new in Alteryx and have a problem creating the right regular expression.

 

I have a list of adresses. I have used this regular expressen: \d+\s.|\d+ (tokenize)

to get the "Husnr1" column. 

 

adresse1Husnr1vejnavn1vejnavn2vejnavn3
Egehegnet 2 C2 CEgehegnet  
Mariendalsvej 50 F,4 tv50 FMariendalsvejtv 
Dronninggårds Alle 138138DronninggårdsAlle 

 

For vejnavn1, vejnavn2 and vejnavn3 I have used this expression: 

\<\u\l+\> (tokenize). 

 

What I would like to get as an output is this:

 

adresse1Husnr1vejnavn1vejnavn2vejnavn3
Egehegnet 2 C2 CEgehegnet  
Mariendalsvej 50 F,4 tv50 FMariendalsvej  
Dronninggårds Alle 138138DronninggårdsAlle 

 

I want everything after the comma (eg 4 tv) to be removed. 

 

Thanks for your help in advance.

 

Regards, Trine

1 REPLY 1
PhilipMannering
16 - Nebula
16 - Nebula

To remove everything after the comma you could just use a split-to-columns on comma delimiter.

 

I think your regular expression is correct. Just untick the Case Insensitive box. This will ensure that you tokenize on a word starting with a capital letter.

Labels