Skip to main navigation Skip to search Skip to main content

Overhead of a decentralized gossip algorithm on the performance of HPC applications

  • Ely Levy
  • , Amnon Barak
  • , Amnon Shiloh
  • , Matthias Lieber
  • , Carsten Weinhold
  • , Hermann Härtig

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Gossip algorithms can provide online information about the availability and the state of the resources in supercomputers. These algorithms require minimal computing and storage capabilities at each node and when properly tuned, they are not expected to overload the nodes or the network that connects these nodes. These properties make gossip interesting for future exascale systems. This paper examines the overhead of a decentralized gossip algorithm on the performance of parallel MPI applications running on up to 8192 nodes of an IBM BlueGene/Q supercomputer. The applications that were used in the experiments include PTRANS and MPI-FFT from the HPCC benchmark suite as well as the coupled weather and cloud simulation model COSMOSPECS+ FD4. In most cases, no gossip overhead was observed when the gossip messages were sent at intervals of 256ms or more. As expected, the overhead that is observed at higher rates is sensitive to the communication pattern of the application and the amount of gossip information being circulated.

Original languageEnglish
Title of host publicationProceedings of the 4th International Workshop on Runtime and Operating Systems for Supercomputers, ROSS 2014 - In Conjunction with ICS 2014
PublisherAssociation for Computing Machinery
ISBN (Print)9781450329507
DOIs
StatePublished - 2014
Event4th International Workshop on Runtime and Operating Systems for Supercomputers, ROSS 2014 - In Conjunction with ICS 2014 - Munich, Germany
Duration: 10 Jun 201410 Jun 2014

Publication series

NameProceedings of the 4th International Workshop on Runtime and Operating Systems for Supercomputers, ROSS 2014 - In Conjunction with ICS 2014

Conference

Conference4th International Workshop on Runtime and Operating Systems for Supercomputers, ROSS 2014 - In Conjunction with ICS 2014
Country/TerritoryGermany
CityMunich
Period10/06/1410/06/14

Keywords

  • Benchmarking
  • Cluster management
  • Gossip algorithm
  • High performance computing

Fingerprint

Dive into the research topics of 'Overhead of a decentralized gossip algorithm on the performance of HPC applications'. Together they form a unique fingerprint.

Cite this