Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Remove Top and Bottom 5% of Records

Brian_H
7 - Meteor

Hello

 

I have a dataset of about 60k lines with only 3 columns of data. They data has outliers on both the high and low end of the data. Therefore, I would like to strip out the highest 5% and lowest 5% based on a specific column that is numeric. I am struggling on figuring out how to do this and am looking for any recommendations.

 

Thanks

 

Brian 

5 REPLIES 5
Joe_Mako
12 - Quasar

Are you looking for something like the attached?

 

top bottom.png

 

- Sort the column of interest

- Record ID to get an incrementing number per record

- Summary to get a total count of records

- Append to have the count on each record

- Filter to remove the top and bottom 5 percent

 

Brian_H
7 - Meteor

Hello Joe,

 

This is exactly what I needed. Thank you!!!

 

Thanks

 

Brian

 

cavano
5 - Atom

Is there a way to achieve the same result but for In-Database?

BenMoss
ACE Emeritus
ACE Emeritus
There is an in-db sample tool. With which you can specify the % or records to return. You could use this twice kn your data stream to get the top 5% and then bottom 5% and then union the results together.

Ben
BenMoss
ACE Emeritus
ACE Emeritus

Alternatively, you could use a SQL statement on input to first sort and then give an incrimental ID to your data; once you have this you have all the other tools in-db that will allow you to create your solution.

 

Link to in-DB append which is not standard in the package: https://gallery.alteryx.com/#!app/Append-In-DB/5ab3a06f826fd310ecda0b47

 

Ben

Labels