Calculate Median and Percentile in Alteryx doesn't work in Large Data set
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi All,
I am trying to calculate the median and 98th percentile for large data set with million of records, and the calculation of median and percentile is not as expected, it messing up somehow. When trying the same calculation using smaller data set it works perfectly fine.
- Labels:
- Developer Tools
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
Hi Krishna,
Yes I am using summarize tool only to calculate the median and percentile but while working with large data set(100millions) the calculation was messing up.
To overcome such problem used recordid tool so it calculating as expected.
Thank you.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Notify Moderator
I’ve recently encountered a significant challenge with my workflow, specifically when working with large datasets (exceeding 10 million entries) and calculating medians. This issue has been quite impactful, and I’m earnestly seeking immediate assistance to address it. In my efforts to troubleshoot, I found that Python-based results are consistent with those derived from the Record tool and sorting methods. The precision of these median calculations is paramount, as they substantially influence my strategic decision-making and analysis. Could you please prioritize addressing this matter?
