Alteryx Designer Desktop Discussions

KmbrlyPC · ‎04-27-2021

I have some messy data in Excel they I am needing to separate out into columns before I can even do anything with the data. Based on my current knowledge of Alteryx, the RegEx tool would probably be a good way to parse out the data, however I am new to this tool and have trouble with arriving at the correct expressions. This would be a huge win because it would save me a ton of time not having to parse this all out using Excel since the actual data is hundreds of lines. Below is a screenshot of some mock data and the outcome I am needing. An Excel spreadsheet is also attached. Thank you in advance!

apathetichell · ‎04-27-2021

try this version as a start.

KmbrlyPC · ‎04-27-2021

Thank you. This is helpful. One thing I came across is the Rule Code may be more than just 1 listed. Sometimes it's 1, but other times it's 3 or 4. For example "Trees, Grass, Shrubs (1-234.56, 4-321.56)". How would I update the RegEx formula to capture this? This would be the 4th tool in the workflow you created.

apathetichell · ‎04-27-2021

It would probably be easiest to do a text to columns split to rows (comma delimited) after you have most of the data set up. this would create two entries with the rest of the data static. I'd probably do the split after crosstab - as otherwise it'll re concatenate on crosstab assuming the column header is the same.

KmbrlyPC · ‎04-27-2021

Could you provide an example of the configuration for the text to columns tool? Attached is the updated mock data that reflects the multiple rule codes Trees, Grass, Shrubs (1-234.56, 4-321.56). Thanks in advance.

KmbrlyPC · ‎04-27-2021

Although, if it is possible to get the RegEx formula to work to recognize the multiple Rule Codes then that would be all I need. Everything else is working great to get the data I need. It's just this one piece.

Current formula below recognizes "Trees, Grass, Shrubs (1-234.56)"

if regex_match([Data],".+\(\d+-\d+\.\d+\)") then [Data] else [Row-1:Title (Sub-Category Description)] endif

But I am needing it to be more holistic to recognize this as well "Trees, Grass, Shrubs (1-234.56, 4-321.56)"

apathetichell · ‎04-27-2021

Try this one...

Alteryx Designer Desktop Discussions

Regex Help

Re: Is there any way the computer vision tools can...

Re: Batch Macro

Re: How to get cell reference address from excel

Re: Replacing Forecast columns with Actual Data

Re: Row creation