Bring your best ideas to the AI Use Case Contest! Enter to win 40 hours of expert engineering support and bring your vision to life using the powerful combination of Alteryx + AI. Learn more now, or go straight to the submission form.
Start Free Trial

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

K-Centroids Clustering - Number of starting seeds

ashi_s
5 - Atom

Hi, 

 

I am actually trying to create a few clusters with my data of about 400 observations, and am a little confused about the role played by the starting number of seeds. 

 

Irrespective of changing my number of starting seeds I end up with the exact same observations in each cluster. Which is why I was trying to understand if starting the analysis with a different value for the number of seeds can actually change my final clusters. And if yes, when does it happen ? 

 

Thanks

1 REPLY 1
jdunkerley79
ACE Emeritus
ACE Emeritus

The K-Means algorithm can converge to local minimums depedning on the input points and the starting seed. Having different starting seeds and then picking the `best` K-mean solution is the standard way to work around this. For you specific dataset it might be that it always converges to the same global minimum.

 

A more detailed post: https://stats.stackexchange.com/a/50316

Labels
Top Solution Authors