Sequential Learning of Principal Curves: Summarizing Data Streams on the
  Fly

Guedj, Benjamin; Li, Le

slides

Sequential Learning of Principal Curves: Summarizing Data Streams on the Fly

Authors: Benjamin Guedj
Le Li
Publication date: 8 May 2019
Publisher

Abstract

When confronted with massive data streams, summarizing data with dimension reduction methods such as PCA raises theoretical and algorithmic pitfalls. Principal curves act as a nonlinear generalization of PCA and the present paper proposes a novel algorithm to automatically and sequentially learn principal curves from data streams. We show that our procedure is supported by regret bounds with optimal sublinear remainder terms. A greedy local search implementation (called \texttt{slpc}, for Sequential Learning Principal Curves) that incorporates both sleeping experts and multi-armed bandit ingredients is presented, along with its regret computation and performance on synthetic and real-life data

Similar works

Full text

Available Versions

Archive Ouverte en Sciences de l'Information et de la Communication

oai:HAL:hal-01796011v2

Last time updated on 09/07/2019