Fast and Robust Distributed Learning in High Dimension

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Could a gradient aggregation rule (GAR) for distributed machine learning be both robust and fast? This paper answers by the affirmative through MULTI-BULYAN. Given n workers, f of which are arbitrary malicious (Byzantine) and m=n-f are not, we prove that MULTI-BULYAN can ensure a strong form of Byzantine resilience, as well as an \fracmn slowdown, compared to averaging, the fastest (but non Byzantine resilient) rule for distributed machine learning. When m\approx n (almost all workers are correct), MULTI-BULYAN reaches the speed of averaging. We also prove that MULTI-BULYAN's cost in local computation is O(d) (like averaging), an important feature for ML where d commonly reaches 109, while robust alternatives have at least quadratic cost in d. Our theoretical findings are complemented with an experimental evaluation which, in addition to supporting the linear O(d) complexity argument, conveys the fact that MULTI-BULYAN's parallelisability further adds to its efficiency.

Original languageEnglish
Title of host publicationProceedings - 2020 International Symposium on Reliable Distributed Systems, SRDS 2020
PublisherIEEE Computer Society
Pages71-80
Number of pages10
ISBN (Electronic)9781728176260
DOIs
Publication statusPublished - 1 Sept 2020
Externally publishedYes
Event39th International Symposium on Reliable Distributed Systems, SRDS 2020 - Virtual, Shanghai, China
Duration: 21 Sept 202024 Sept 2020

Publication series

NameProceedings of the IEEE Symposium on Reliable Distributed Systems
Volume2020-September
ISSN (Print)1060-9857

Conference

Conference39th International Symposium on Reliable Distributed Systems, SRDS 2020
Country/TerritoryChina
CityVirtual, Shanghai
Period21/09/2024/09/20

Keywords

  • Byzantine Resilience
  • Distributed Systems
  • High Dimension
  • Machine Learning
  • Non-Convex Optimization
  • Stochastic Gradient Descent

Fingerprint

Dive into the research topics of 'Fast and Robust Distributed Learning in High Dimension'. Together they form a unique fingerprint.

Cite this