K-means++ is an important algorithm to choose initial cluster centers for the
k-means clustering algorithm. In this work, we present a new algorithm that can
solve the k-means++ problem with near optimal running time. Given n data
points in Rd, the current state-of-the-art algorithm runs in
O(k) iterations, and each iteration takes O(ndk)
time. The overall running time is thus O(ndk2). We propose a
new algorithm \textsc{FastKmeans++} that only takes in O(nd+nk2) time, in total