In case you missed the announcement: Alteryx One is here, and so is the Spring Release! Learn more about these new and exciting releases here!

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Data Profiling - Pattern Analysis

froseph
6 - Meteoroid

Hi,

 

One feature which would be of significant value for Data Profiling is Pattern Analysis, with a view of pattern frequencies. This is available in many Data Profiling tools, including the free Talend Open Studio:

 

froseph_0-1611087452496.png

 

Is this something which can be easily produced in Alteryx, or is this a feature on the Data Profiling functionality roadmap?

 

Cheers!

 

7 REPLIES 7
Emil_Kos
17 - Castor
17 - Castor

Hi @froseph,


If you want this idea to get implemented in the Alteryx Designer you should post it in this place.

 

https://community.alteryx.com/t5/Alteryx-Designer-Ideas/idb-p/product-ideas

 

If there will be a lot of positive feedback it should get implemented sooner or later 🙂 

smoskowitz
12 - Quasar

Hi @froseph 

 

If you add a browse tool to the end of your data and then click it after a run, the data is profiled in the configuration section. Were you aware of this or am I mistaken as to your ask?

 

Thanks,

Seth

froseph
6 - Meteoroid

Thanks Seth - This shows frequency tables, but not frequency tables of common (and uncommon) patterns - Unless I am missing something?

smoskowitz
12 - Quasar

You could also try the Field Summary tool in the Data Investigation tab. That might give you further detail. There is also a Frequency Table tool, but you might need to download the extra tools. 

 

Seth

smoskowitz
12 - Quasar

This will give you a few extra tools in your data investigation tab that you might find of use.

 

Seth

 

smoskowitz_0-1611087974821.png

 

froseph
6 - Meteoroid

Not quite what I'm looking for - in essence the tool should recognize common patterns in string fields, translate these into a kind of pseudo-regex and then produce a frequency table of those patterns as opposed to the values, a la:

 

Patterns - this is what we want:

froseph_0-1611088015321.png

 

versus Values - this is not what we want:

 

froseph_1-1611088054914.png

 

Note that screenshots are from Talend Open Studio - not Alteryx. 

 

 

smoskowitz
12 - Quasar

Another option is the summary tool. Just Groupby the Value use a Count Records tool to get a total amount of records. The append the Count Records to each record and do the math.

 

Based on what you are showing, it looks easy enough to create. I dropped in a quick mock up for you.

 

I Hope this helps.

 

Seth

Labels
Top Solution Authors