DRAG: Divergence-based Adaptive Aggregation in Federated learning on
  Non-IID Data

Liu, Shengyun; Wang, Xin; Zhang, Jingjing; Zhu, Feng

DRAG: Divergence-based Adaptive Aggregation in Federated learning on Non-IID Data

Authors: Shengyun Liu
Xin Wang
Jingjing Zhang
Feng Zhu
Publication date: 4 September 2023
Publisher

Abstract

Local stochastic gradient descent (SGD) is a fundamental approach in achieving communication efficiency in Federated Learning (FL) by allowing individual workers to perform local updates. However, the presence of heterogeneous data distributions across working nodes causes each worker to update its local model towards a local optimum, leading to the phenomenon known as ``client-drift" and resulting in slowed convergence. To address this issue, previous works have explored methods that either introduce communication overhead or suffer from unsteady performance. In this work, we introduce a novel metric called ``degree of divergence," quantifying the angle between the local gradient and the global reference direction. Leveraging this metric, we propose the divergence-based adaptive aggregation (DRAG) algorithm, which dynamically ``drags" the received local updates toward the reference direction in each round without requiring extra communication overhead. Furthermore, we establish a rigorous convergence analysis for DRAG, proving its ability to achieve a sublinear convergence rate. Compelling experimental results are presented to illustrate DRAG's superior performance compared to state-of-the-art algorithms in effectively managing the client-drift phenomenon. Additionally, DRAG exhibits remarkable resilience against certain Byzantine attacks. By securely sharing a small sample of the client's data with the FL server, DRAG effectively counters these attacks, as demonstrated through comprehensive experiments

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2309.01779

Last time updated on 12/09/2023