Efficient Communication/Computation Overlap with MPI+OpenMP Runtimes Collaboration

Marc Sergent, Mario Dagrada, Patrick Carribault, Julien Jaeger, Marc Pérache, Guillaume Papauré

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Overlap network communications and computations is a major requirement to ensure scalability of HPC applications on future exascale machines. To this purpose the de-facto MPI standard provides non-blocking routines for asynchronous communication progress. In various implementations, a dedicated progress thread (PT) is deployed on the host CPU to actually achieve this overlap. However, current PT solutions struggle to find a balance between efficient detection of network events and minimal impact on the application computations. In this paper we propose a solution inspired from the PT approach which benefits from idle time of compute threads to make MPI communication progress in background. We implement our idea in the context of MPI+OpenMP collaboration using the OpenMP Tools interface which will be part of the OpenMP 5.0 standard. Our solution shows an overall performance gain on unbalanced workloads such as the AMG CORAL benchmark.

Original languageEnglish
Title of host publicationEuro-Par 2018
Subtitle of host publicationParallel Processing - 24th International Conference on Parallel and Distributed Computing, Proceedings
EditorsMassimo Torquati, Marco Aldinucci, Luca Padovani
PublisherSpringer Verlag
Pages560-572
Number of pages13
ISBN (Print)9783319969824
DOIs
Publication statusPublished - 1 Jan 2018
Externally publishedYes
Event24th International European Conference on Parallel and Distributed Computing, Euro-Par 2018 - Turin, Italy
Duration: 27 Aug 201831 Aug 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11014 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference24th International European Conference on Parallel and Distributed Computing, Euro-Par 2018
Country/TerritoryItaly
CityTurin
Period27/08/1831/08/18

Keywords

  • Distributed computing
  • Parallel computing
  • Runtime collaboration
  • Runtime systems

Fingerprint

Dive into the research topics of 'Efficient Communication/Computation Overlap with MPI+OpenMP Runtimes Collaboration'. Together they form a unique fingerprint.

Cite this