Search CORE

2 research outputs found

High Dimensional Clustering with $r$ -nets

Author: Avarikioti Georgia
Ryser Alain
Wang Yuyi
Wattenhofer Roger
Publication venue
Publication date: 06/11/2018
Field of study

Clustering, a fundamental task in data science and machine learning, groups a set of objects in such a way that objects in the same cluster are closer to each other than to those in other clusters. In this paper, we consider a well-known structure, so-called

r

-nets, which rigorously captures the properties of clustering. We devise algorithms that improve the run-time of approximating

r

-nets in high-dimensional spaces with

\ell_1

and

\ell_2

metrics from

\tilde{O}(dn^{2-\Theta(\sqrt{\epsilon})})

\tilde{O}(dn + n^{2-\alpha})

, where

\alpha = \Omega({\epsilon^{1/3}}/{\log(1/\epsilon)})

. These algorithms are also used to improve a framework that provides approximate solutions to other high dimensional distance problems. Using this framework, several important related problems can also be solved efficiently, e.g.,