Hi All,
I am trying to calculate the median and 98th percentile for large data set with million of records, and the calculation of median and percentile is not as expected, it messing up somehow. When trying the same calculation using smaller data set it works perfectly fine.
Hi Krishna,
Yes I am using summarize tool only to calculate the median and percentile but while working with large data set(100millions) the calculation was messing up.
To overcome such problem used recordid tool so it calculating as expected.
Thank you.
I’ve recently encountered a significant challenge with my workflow, specifically when working with large datasets (exceeding 10 million entries) and calculating medians. This issue has been quite impactful, and I’m earnestly seeking immediate assistance to address it. In my efforts to troubleshoot, I found that Python-based results are consistent with those derived from the Record tool and sorting methods. The precision of these median calculations is paramount, as they substantially influence my strategic decision-making and analysis. Could you please prioritize addressing this matter?