2000 character limit reached
Fixed-sized clusters $k$-Means
Published 27 Jan 2025 in cs.LG | (2501.16113v1)
Abstract: We present a $k$-means-based clustering algorithm, which optimizes the mean square error, for given cluster sizes. A straightforward application is balanced clustering, where the sizes of each cluster are equal. In the $k$-means assignment phase, the algorithm solves an assignment problem using the Hungarian algorithm. This makes the assignment phase time complexity $O(n3)$. This enables clustering of datasets of size more than 5000 points.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.