Clustering-Using-KMeans

K-means is an unsupervised clustering algorithm that tries to partition a set of points into k clusters. It is used when you have to group a collection of stuff into various clusters.

The Algorithm:

Assign random positions for k centroids
Compute the distance of each point from the centroids and assign each point to its nearest centroid, thereby forming k clusters
Take the mean of the distance of the points assigned to each centroid. This now becomes the positions of the new centroids
Now check the error(distance) between the positions of old and new centroids.
If the error is not equal to 0, repeat steps 3 and 4. If the positions of the old and new centroids match, then the required clusters are formed!

This gif might help you better understand the algorithm

However, using Python's Scikit to perform KMeans is much simpler The outputs can be found here.

Some practical applications:

Pricing Segmentation
Customer Need Segmentation
Loyalty Segmentation
Where do millionaires live?
Create stereotypes from demographics data

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Dataset		Dataset
Output		Output
.gitignore		.gitignore
KMeans.gif		KMeans.gif
KMeans.py		KMeans.py
KMeansUsingScikit.py		KMeansUsingScikit.py
KMeansWithoutComments.py		KMeansWithoutComments.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clustering-Using-KMeans

The Algorithm:

Some practical applications:

About

Releases

Packages

Languages

Surya-Murali/Clustering-Using-KMeans

Folders and files

Latest commit

History

Repository files navigation

Clustering-Using-KMeans

The Algorithm:

Some practical applications:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages