Search CORE

901 research outputs found

Cooperative Online Learning: Keeping your Neighbors Updated

Author: Cesa-Bianchi Nicolò
Cesari Tommaso R.
Monteleoni Claire
Publication venue
Publication date: 01/01/2020
Field of study

We study an asynchronous online learning setting with a network of agents. At each time step, some of the agents are activated, requested to make a prediction, and pay the corresponding loss. The loss function is then revealed to these agents and also to their neighbors in the network. Our results characterize how much knowing the network structure affects the regret as a function of the model of agent activations. When activations are stochastic, the optimal regret (up to constant factors) is shown to be of order

\sqrt{\alpha T}

, where

T

is the horizon and

\alpha

is the independence number of the network. We prove that the upper bound is achieved even when agents have no information about the network structure. When activations are adversarial the situation changes dramatically: if agents ignore the network structure, a

\Omega(T)

lower bound on the regret can be proven, showing that learning is impossible. However, when agents can choose to ignore some of their neighbors based on the knowledge of the network structure, we prove a

O(\sqrt{\overline{\chi} T})

sublinear regret bound, where

\overline{\chi} \ge \alpha

is the clique-covering number of the network

arXiv.org e-Print Archive

AIR Universita degli studi di Milano

Handling Delayed Feedback in Distributed Online Optimization : A Projection-Free Approach

Author: Nguyen Tuan-Anh
Thang Nguyen Kim
Trystram Denis
Publication venue
Publication date: 03/02/2024
Field of study

Learning at the edges has become increasingly important as large quantities of data are continually generated locally. Among others, this paradigm requires algorithms that are simple (so that they can be executed by local devices), robust (again uncertainty as data are continually generated), and reliable in a distributed manner under network issues, especially delays. In this study, we investigate the problem of online convex optimization under adversarial delayed feedback. We propose two projection-free algorithms for centralised and distributed settings in which they are carefully designed to achieve a regret bound of O(\sqrt{B}) where B is the sum of delay, which is optimal for the OCO problem in the delay setting while still being projection-free. We provide an extensive theoretical study and experimentally validate the performance of our algorithms by comparing them with existing ones on real-world problems

arXiv.org e-Print Archive

Learning and Management for Internet-of-Things: Accounting for Adaptivity and Scalability

Author: Barbarossa Sergio
Chen Tianyi
Giannakis Georgios B.
Wang Xin
Zhang Zhi-Li
Publication venue
Publication date: 27/10/2018
Field of study

Internet-of-Things (IoT) envisions an intelligent infrastructure of networked smart devices offering task-specific monitoring and control services. The unique features of IoT include extreme heterogeneity, massive number of devices, and unpredictable dynamics partially due to human interaction. These call for foundational innovations in network design and management. Ideally, it should allow efficient adaptation to changing environments, and low-cost implementation scalable to massive number of devices, subject to stringent latency constraints. To this end, the overarching goal of this paper is to outline a unified framework for online learning and management policies in IoT through joint advances in communication, networking, learning, and optimization. From the network architecture vantage point, the unified framework leverages a promising fog architecture that enables smart devices to have proximity access to cloud functionalities at the network edge, along the cloud-to-things continuum. From the algorithmic perspective, key innovations target online approaches adaptive to different degrees of nonstationarity in IoT dynamics, and their scalable model-free implementation under limited feedback that motivates blind or bandit approaches. The proposed framework aspires to offer a stepping stone that leads to systematic designs and analysis of task-specific learning and management schemes for IoT, along with a host of new research directions to build on.Comment: Submitted on June 15 to Proceeding of IEEE Special Issue on Adaptive and Scalable Communication Network

arXiv.org e-Print Archive

Archivio della ricerca- Università di Roma La Sapienza