Skip to main content
All CollectionsFAQModeling FAQClustering FAQ
How does G2M optimize the number of clusters when clustering a dataset?
How does G2M optimize the number of clusters when clustering a dataset?

This article discusses how the G2M platform optimizes the number of clusters when clustering a dataset

Updated over 5 months ago

When optimizing the number of clusters for a dataset, the G2M platform will pick the number of clusters associated with the highest Silhouette score. Note that the Silhouette score is most relevant when dealing with well-behaved, i.e. convex-shaped, clusters. In many real-life cases clusters are not convex and the Silhouette score may no longer be relevant. In this case you are best served relying on your domain expertise to identify the most relevant number of clusters.

Did this answer your question?