Get Inspire insights from former attendees in our AMA discussion thread on Inspire Buzz. ACEs and other community members are on call all week to answer!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Regular expression

Hi

I am new in Alteryx and have a problem creating the right regular expression.

 

I have a list of adresses. I have used this regular expressen: \d+\s.|\d+ (tokenize)

to get the "Husnr1" column. 

 

adresse1Husnr1vejnavn1vejnavn2vejnavn3
Egehegnet 2 C2 CEgehegnet  
Mariendalsvej 50 F,4 tv50 FMariendalsvejtv 
Dronninggårds Alle 138138DronninggårdsAlle 

 

For vejnavn1, vejnavn2 and vejnavn3 I have used this expression: 

\<\u\l+\> (tokenize). 

 

What I would like to get as an output is this:

 

adresse1Husnr1vejnavn1vejnavn2vejnavn3
Egehegnet 2 C2 CEgehegnet  
Mariendalsvej 50 F,4 tv50 FMariendalsvej  
Dronninggårds Alle 138138DronninggårdsAlle 

 

I want everything after the comma (eg 4 tv) to be removed. 

 

Thanks for your help in advance.

 

Regards, Trine

1 REPLY 1
PhilipMannering
16 - Nebula
16 - Nebula

To remove everything after the comma you could just use a split-to-columns on comma delimiter.

 

I think your regular expression is correct. Just untick the Case Insensitive box. This will ensure that you tokenize on a word starting with a capital letter.

Labels