Hi Alteryx Community,
I am trying to do Challenge #300 of the Alteryx Challenge (Challenge #300: Where is the party? - Alteryx Community)
I did not know how to parse text so I looked at the solution, but I am struggling to understand the RegEx parse analysis.
Country ISO codes and country names are put together in a string (Screenshot 1).
To separate the first 2 capital letters from the rest of the text the solution uses the following RegEx code: (\u{2})(.*)
Could someone help me understand what does \u mean?
Also, I found a lot RegEx documentation online and apparently there are different types of RegEx. Is Alteryx's RegEx python- or java-based? Where can I find documentation to learn more about RegEx for Alteryx?
Thank you!
Screenshot 1:
Solved! Go to Solution.
\u means the uppercase letter.
(\u{2}) - this would give you two Uppercase letters in that column.
hi @selsset97
I would recommend the tooll mastery series here is the REg ex tool mastery article:
https://community.alteryx.com/t5/Alteryx-Designer-Knowledge-Base/Tool-Mastery-RegEx/ta-p/37689
\u stands for up[[[er case letter
In order to pull out the first two characters you could alwys use Left function inthe formula tool for a less complicated approach.
Hi @selsset97
The \u simply means Upper case character. On the Regex Parse tool the + sign has a drop down which provides details on the basic regex functionality available which can help.
A lot of people recommend using https://regex101.com/ to help as well while learning and testing your regex expressions as well.
Thank you for your reply!
May I also ask you which of the following "flavors" is Alteryx based on in the regex101: build, test, and debug regex website?
E.g., PCRE2, PCRE, ECMAScript, Python, Golang, etc.
Alteryx uses Perl Regular Expression Syntax, so PCRE2 should work fine (at least it's what i've always left it on when i needed to use the website :) )
A caution to you and future readers: Alteryx defaults to case-insensitive use of RegEx. If you are using a RegEx formula, you need to include a icase parameter of 0. If using the RegEx tool, make sure that the case insensitivity is off.
REGEX_Replace
REGEX_Replace(String, pattern, replace, icase): Allows replacement of text using regular expressions and returns the string resulting from the RegEx find (pattern) and replace (string). All occurrences of the match are replaced, not just the first.
The replace parameter can be either a specified value as shown below or a marked group, like "$1". Make sure that marked groups are wrapped in double quotes.
icase is an optional parameter. When specified, the case must match.
You want to avoid matching xx when you wanted to only match XX.
Cheers,
Mark