-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
for the same set of data, the centroids vary for new run #1
Comments
I'm also seeing this problem. Appears to generate new centroids on every run of identical data |
Is there a lot of local minima in your data set? |
Yes. This is pixel RGB color data from an image |
Yes, that's linked to your problem.
|
Any easy fix? |
This is one todo. I think that can be solved with different solutions:
I'm happy if you create a Pull Request with a solution. Thanks, |
Appreciate the time. I'll try to look into it after next week if i have some time! |
One of the solution used in
Note about inertia (from sklearn): Sum of squared distances of samples to their closest cluster center. @Philmod I can do a PR |
For each new run of node-kmeans on the same set of data, the clusters and centroids vary. Is there any way we can fix the skewed results or probably start with a constant seed.
The text was updated successfully, but these errors were encountered: