A Faster $k$-means++ Algorithm

Liang, Jiehao; Sarkhel, Somdeb; Song, Zhao; Yin, Chenbo; Zhuo, Danyang

A Faster $k$ -means++ Algorithm

Authors: Jiehao Liang
Somdeb Sarkhel
Zhao Song
Chenbo Yin
Danyang Zhuo
Publication date: 28 November 2022
Publisher

Abstract

K-means++ is an important algorithm to choose initial cluster centers for the k-means clustering algorithm. In this work, we present a new algorithm that can solve the

k

-means++ problem with near optimal running time. Given

n

data points in

\mathbb{R}^d

, the current state-of-the-art algorithm runs in

\widetilde{O}(k )

iterations, and each iteration takes

\widetilde{O}(nd k)

time. The overall running time is thus

\widetilde{O}(n d k^2)

. We propose a new algorithm \textsc{FastKmeans++} that only takes in

\widetilde{O}(nd + nk^2)

time, in total

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2211.15118

Last time updated on 30/12/2022