Resource-Management Study in HPC Runtime-Stacking Context

Arthur Loussert, Benoît Welterlen, Patrick Carribault, Julien Jaeger, Marc Pérache, Raymond Namyst

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

With the advent of multicore and manycore processors as building blocks of HPC supercomputers, many applications shift from relying solely on a distributed programming model (e.g., MPI) to mixing distributed and shared-memory models (e.g., MPI+OpenMP), to better exploit shared-memory communications and reduce the overall memory footprint. One side effect of this programming approach is runtime stacking: mixing multiple models involve various runtime libraries to be alive at the same time and to share the underlying computing resources. This paper explores different configurations where this stacking may appear and introduces algorithms to detect the misuse of compute resources when running a hybrid parallel application. We have implemented our algorithms inside a dynamic tool that monitors applications and outputs resource usage to the user. We validated this tool on applications from CORAL benchmarks. This leads to relevant information which can be used to improve runtime placement, and to an average overhead lower than 1% of total execution time.

Original languageEnglish
Title of host publicationProceedings - 29th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages177-184
Number of pages8
ISBN (Electronic)9781509012336
DOIs
Publication statusPublished - 8 Nov 2017
Externally publishedYes
Event29th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2017 - Campinas, Brazil
Duration: 17 Oct 201720 Oct 2017

Publication series

NameProceedings - 29th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2017

Conference

Conference29th International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2017
Country/TerritoryBrazil
CityCampinas
Period17/10/1720/10/17

Keywords

  • HPC
  • MPI
  • OpenMP
  • Parallel Programming

Fingerprint

Dive into the research topics of 'Resource-Management Study in HPC Runtime-Stacking Context'. Together they form a unique fingerprint.

Cite this