Skip to main navigation Skip to search Skip to main content

Corrected Gossip Algorithms for Fast Reliable Broadcast on Unreliable Systems

  • Torsten Hoefler
  • , Amnon Barak
  • , Amnon Shiloh
  • , Zvi Drezner

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Scopus citations

Abstract

Large-scale parallel programming environments and algorithms require efficient group-communication on computing systems with failing nodes. Existing reliable broadcast algorithms either cannot guarantee that all nodes are reached or are very expensive in terms of the number of messages and latency. This paper proposes Corrected-Gossip, a method that combines Monte Carlo style gossiping with a deterministic correction phase, to construct a Las Vegas style reliable broadcast that guarantees reaching all the nodes at low cost. We analyze the performance of this method both analytically and by simulations and show how it reduces the latency and network load compared to existing algorithms. Our method improves the latency by 20% and the network load by 53% compared to the fastest known algorithm on 4,096 nodes. We believe that the principle of corrected-gossip opens an avenue for many other reliable group communication operations.

Original languageEnglish
Title of host publicationProceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium, IPDPS 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages357-366
Number of pages10
ISBN (Electronic)9781538639146
DOIs
StatePublished - 30 Jun 2017
Event31st IEEE International Parallel and Distributed Processing Symposium, IPDPS 2017 - Orlando, United States
Duration: 29 May 20172 Jun 2017

Publication series

NameProceedings - 2017 IEEE 31st International Parallel and Distributed Processing Symposium, IPDPS 2017

Conference

Conference31st IEEE International Parallel and Distributed Processing Symposium, IPDPS 2017
Country/TerritoryUnited States
CityOrlando
Period29/05/172/06/17

Bibliographical note

Publisher Copyright:
© 2017 IEEE.

Keywords

  • Gossip algorithms
  • reliable broadcast

Fingerprint

Dive into the research topics of 'Corrected Gossip Algorithms for Fast Reliable Broadcast on Unreliable Systems'. Together they form a unique fingerprint.

Cite this