Alteryx Designer Desktop Discussions

Smiller100 · ‎08-18-2021

Hello Alteryx Community,

I am still fairly new to Alteryx, and Regex by extension, and I have a column that I am trying to parse out where I want to capture everything separately, and it may repeat once or more, while excluding certain characters. I'll include examples below:

01234 · Word Word

01234 · Word Word:01234 · Word Word

01234 · Word Word:01234 · Word Word:01234 · Word word

01234 · Word Word:01234 · Word Word:01234 · Word word:01234 · Word word

(Please note the "·" and ":" in the data)

My current expression is as follows: ((?:[^·:])+)

This expression matches everything separately and repeats itself, but it places all the matches in Group 1 instead of Group 0, which Alteryx does not seem to like. Any ideas on how I can change this so that everything can be in Group 0? Or some help with writing a different expression would be much appreciated.

Edit: I updated the expression to match everything to Group 0: (?:[^·:])+

However, Alteryx returns an error saying there is Nothing to Parse. I'm getting closer but still just need help getting Alteryx to recognize this.

john_watkins · ‎08-18-2021

So is the . or : a delimiter for what represents a word or just any space? It would be good to know which parts of the sample rows you consider a match above.

Smiller100 · ‎08-18-2021

I did end up figuring it out, I just changed the output method from Parse to Tokenize and it captured everything. Thank you though

Alteryx Designer Desktop Discussions

Regex - How to capture everything but certain characters, repeated once or more