Search CORE

48 research outputs found

Fully Dynamic Consistent Facility Location

Author: Chris Schwiegelshohn
David Saulpic
Niklas Oskar D. Hjuler
Nikos Parotsidis
Vincent Cohen-Addad
Publication venue: H. Wallach and H. Larochelle and A. Beygelzimer and F. d'Alch'e-Buc and E. Fox and R. Garnett
Publication date: 01/01/2019
Field of study

We consider classic clustering problems in fully dynamic data streams, where data elements can be both inserted and deleted. In this context, several parameters are of importance: (1) the quality of the solution after each insertion or deletion, (2) the time it takes to update the solution, and (3) how different consecutive solutions are. The question of obtaining efficient algorithms in this context for facility location, k-median and k-means has been raised in a recent paper by Hubert-Chan et al. [WWW'18] and also appears as a natural follow-up on the online model with recourse studied by Lattanzi and Vassilvitskii [ICML'17] (i.e.: in insertion-only streams). In this paper, we focus on general metric spaces and mainly on the facility location problem. We give an arguably simple algorithm that maintains a constant factor approximation, with O(n log n) update time, and total recourse O(n). This improves over the naive algorithm which consists in recomputing a solution at each time step and that can take up to O(n^2) update time, and O(n^2) total recourse. These bounds are nearly optimal: in general metric space, inserting a point take O(n) times to describe the distances to other points, and we give a simple lower bound of O(n) for the recourse. Moreover, we generalize this result for the k-medians and k-means problems: our algorithm maintains a constant factor approximation in time O˜(n+k^2). We complement our analysis with experiments showing that the cost of the solution maintained by our algorithm at any time t is very close to the cost of a solution obtained by quickly recomputing a solution from scratch at time t while having a much better running time

HAL Descartes

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Archivio della ricerca- Università di Roma La Sapienza

Hal-Diderot

Polynomial Fitting of Data Streams with Applications to Codeword Testing

Author: McGregor Andrew
Rudra Atri
Uurtamo Steve
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 28th International Symposium on Theoretical Aspects of Computer Science (STACS 2011)
Publication date: 01/01/2011
Field of study

Given a stream of

(x,y)

points, we consider the problem of finding univariate polynomials that best fit the data. Over finite fields, this problem encompasses the well-studied problem of decoding Reed-Solomon codes while over the reals it corresponds to the well-studied polynomial regression problem. We present one-pass algorithms for two natural problems: i) find the polynomial of a given degree

k

that minimizes the error and ii) find the polynomial of smallest degree that interpolates through the points with at most a given error bound. We consider a range of error models including the average error per point, the maximum error, and the number of points that are not fitted exactly. Many of our results apply to both the reals and finite fields. As a consequence we also solve an open question regarding the tolerant testing of codes in the data stream model

CiteSeerX

Dagstuhl Research Online Publication Server

Fully Dynamic Consistent Facility Location

Author: Cohen-Addad Vincent
Hjuler Niklas Oskar D.
Parotsidis Nikos
Saulpic David
SCHWIEGELSHOHN CHRIS RENE
Publication venue: H. Wallach and H. Larochelle and A. Beygelzimer and F. d'Alch'e-Buc and E. Fox and R. Garnett
Publication date: 01/01/2019
Field of study

Archivio della ricerca- Università di Roma La Sapienza