Search CORE

850,735 research outputs found

Streaming, Distributed Variational Inference for Bayesian Nonparametrics

Author: Campbell Trevor
Fisher III John W.
How Jonathan P.
Straub Julian
Publication venue
Publication date: 30/10/2015
Field of study

This paper presents a methodology for creating streaming, distributed inference algorithms for Bayesian nonparametric (BNP) models. In the proposed framework, processing nodes receive a sequence of data minibatches, compute a variational posterior for each, and make asynchronous streaming updates to a central model. In contrast to previous algorithms, the proposed framework is truly streaming, distributed, asynchronous, learning-rate-free, and truncation-free. The key challenge in developing the framework, arising from the fact that BNP models do not impose an inherent ordering on their components, is finding the correspondence between minibatch and central BNP posterior components before performing each update. To address this, the paper develops a combinatorial optimization problem over component correspondences, and provides an efficient solution technique. The paper concludes with an application of the methodology to the DP mixture model, with experimental results demonstrating its practical scalability and performance.Comment: This paper was presented at NIPS 2015. Please use the following BibTeX citation: @inproceedings{Campbell15_NIPS, Author = {Trevor Campbell and Julian Straub and John W. {Fisher III} and Jonathan P. How}, Title = {Streaming, Distributed Variational Inference for Bayesian Nonparametrics}, Booktitle = {Advances in Neural Information Processing Systems (NIPS)}, Year = {2015}

arXiv.org e-Print Archive

DSpace@MIT

Recommended from our members

Distributed mentoring: peer interaction and collaborative learning in P2PU

Author: Alevizou Panagiota
Publication venue
Publication date: 01/11/2010
Field of study

This paper explores how learning design and peer behaviour develops and evolves in a free, open online learning community, the Peer-to-peer university (P2PU). Drawing on ideas relating to 'participatory learning' (Seely-Brown and Adler, 2008), it begins with a theoretical discussion of the ways in which the infrastructural and the social dimensions of peer learning are expressed in terms of the design of three courses, and in relation to mentoring and peer interaction. Evident from the textual interface and social organization of the three courses is that the role of the instructor or course organizer adheres a cooperative model (Burge, 1994), reflected in the aggregation and filtering of materials and the evolution of pedagogical modeling. While the models of participation and engagement vary, depending on socio-technical factors, evident is that the governance model allows both for light models of involvement and the evolution of inquiry towards what we would like to call 'distributed' mentoring. We conclude with an evaluation of the ways in which the courses under study promote a participatory infrastructure, that not only can make the process of learning transparent, but also represent a relationship between teaching and learning in an open fashion

Open Research Online (The Open University)

Client Selection for Federated Bayesian Learning

Author: Kassab Rahif
Liu Yuan
Yang Jiarong
Publication venue
Publication date: 11/12/2022
Field of study

Distributed Stein Variational Gradient Descent (DSVGD) is a non-parametric distributed learning framework for federated Bayesian learning, where multiple clients jointly train a machine learning model by communicating a number of non-random and interacting particles with the server. Since communication resources are limited, selecting the clients with most informative local learning updates can improve the model convergence and communication efficiency. In this paper, we propose two selection schemes for DSVGD based on Kernelized Stein Discrepancy (KSD) and Hilbert Inner Product (HIP). We derive the upper bound on the decrease of the global free energy per iteration for both schemes, which is then minimized to speed up the model convergence. We evaluate and compare our schemes with conventional schemes in terms of model accuracy, convergence speed, and stability using various learning tasks and datasets

arXiv.org e-Print Archive

Byzantine-Resilient Learning Beyond Gradients: Distributing Evolutionary Search

Author: Dolamic Ljiljana
Guerraoui Rachid
Kucharavy Andrei
Monti Matteo
Publication venue
Publication date: 20/04/2023
Field of study

Modern machine learning (ML) models are capable of impressive performances. However, their prowess is not due only to the improvements in their architecture and training algorithms but also to a drastic increase in computational power used to train them. Such a drastic increase led to a growing interest in distributed ML, which in turn made worker failures and adversarial attacks an increasingly pressing concern. While distributed byzantine resilient algorithms have been proposed in a differentiable setting, none exist in a gradient-free setting. The goal of this work is to address this shortcoming. For that, we introduce a more general definition of byzantine-resilience in ML - the \textit{model-consensus}, that extends the definition of the classical distributed consensus. We then leverage this definition to show that a general class of gradient-free ML algorithms - (

1,\lambda

)-Evolutionary Search - can be combined with classical distributed consensus algorithms to generate gradient-free byzantine-resilient distributed learning algorithms. We provide proofs and pseudo-code for two specific cases - the Total Order Broadcast and proof-of-work leader election.Comment: 10 pages, 4 listings, 2 theorem

arXiv.org e-Print Archive

RSA: Byzantine-Robust Stochastic Aggregation Methods for Distributed Learning from Heterogeneous Datasets

Author: Chen Tianyi
Giannakis Georgios B.
Li Liping
Ling Qing
Xu Wei
Publication venue
Publication date: 17/07/2019
Field of study

In this paper, we propose a class of robust stochastic subgradient methods for distributed learning from heterogeneous datasets at presence of an unknown number of Byzantine workers. The Byzantine workers, during the learning process, may send arbitrary incorrect messages to the master due to data corruptions, communication failures or malicious attacks, and consequently bias the learned model. The key to the proposed methods is a regularization term incorporated with the objective function so as to robustify the learning task and mitigate the negative effects of Byzantine attacks. The resultant subgradient-based algorithms are termed Byzantine-Robust Stochastic Aggregation methods, justifying our acronym RSA used henceforth. In contrast to most of the existing algorithms, RSA does not rely on the assumption that the data are independent and identically distributed (i.i.d.) on the workers, and hence fits for a wider class of applications. Theoretically, we show that: i) RSA converges to a near-optimal solution with the learning error dependent on the number of Byzantine workers; ii) the convergence rate of RSA under Byzantine attacks is the same as that of the stochastic gradient descent method, which is free of Byzantine attacks. Numerically, experiments on real dataset corroborate the competitive performance of RSA and a complexity reduction compared to the state-of-the-art alternatives.Comment: To appear in AAAI 201

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications