What is Clustering?
K-Means Clustering is an unsupervised learning algorithm that is used to solve clustering problems in machine learning or data science. In this topic, we will learn the K-means clustering algorithm, how the algorithm works, and the Python implementation of k-means clustering.
It is an iterative algorithm that divides the unlabeled dataset into k different clusters in such a way that each dataset belongs to only one group that has similar properties
How does the K-Means Algorithm Work?
Use Case in the security domain:
Identifying crime localities:
With data related to crimes available in specific localities in a city, the category of crime, the area of the crime, and the association between the two can give quality insight into crime-prone areas within a city or a locality. here is a sample implementation of the k-means for document clustering.
Call record detail analysis
Recommended by LinkedIn
Automatic clustering of it alerts
Delivery store optimization
Drawbacks
Kmeans algorithm is good in capturing the structure of the data if clusters have a spherical-like shape. It always tries to construct a nice spherical shape around the centroid. That means that the minute the clusters have complicated geometric shapes, k-means do a poor job in clustering the data.
Conclusion
K-means algorithm is useful for undirected knowledge discovery and is relatively simple. K-means has found widespread usage in a lot of fields, ranging from unsupervised learning of neural networks, Pattern recognitions, Classification analysis, Artificial intelligence, image processing, machine vision, and many others.
Thank You For Reading