An Overview on Mixing MPI and OpenMP Dependent Tasking on A64FX

Romain Pereira, Adrien Roussel, Miwako Tsuji, Patrick Carribault, Mitsuhisa Sato, Hitoshi Murai, Thierry Gautier

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The adoption of ARM processor architectures is on the rise in the HPC ecosystem. Fugaku supercomputer is a homogeneous ARM-based machine, and is one among the most powerful machine in the world. In the programming world, dependent task-based programming models are gaining tractions due to their many advantages: dynamic load balancing, implicit expression of communication/computation overlap, early-bird communication posting, . . . MPI and OpenMP are two widespreads programming standards that make possible task-based programming at a distributed memory level. Despite its many advantages, mixed-use of the standard programming models using dependent tasks is still under-evaluated on large-scale machines. In this paper, we provide an overview on mixing OpenMP dependent tasking model with MPI with the state-of-the-art software stack (GCC-13, Clang17, MPC-OMP). We provide the level of performances to expect by porting applications to such mixed-use of the standard on the Fugaku supercomputers, using two benchmarks (Cholesky, HPCCG) and a proxy-application (LULESH). We show that software stack, resource binding and communication progression mechanisms are factors that have a significant impact on performance. On distributed applications, performances reaches up to 80% of effiency for task-based applications like HPCCG. We also point-out a few areas of improvements in OpenMP runtimes.

Original languageEnglish
Title of host publicationProceedings of International Conference on High Performance Computing in Asia-Pacific Region Workshops, HPC Asia 2024 Workshops
PublisherAssociation for Computing Machinery
Pages7-16
Number of pages10
ISBN (Electronic)9798400716522
DOIs
Publication statusPublished - 11 Jan 2024
Externally publishedYes
Event2024 International Conference on High Performance Computing in Asia-Pacific Region Workshops, HPC Asia 2024 Workshops - Nagoya, Japan
Duration: 25 Jan 2024 → …

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2024 International Conference on High Performance Computing in Asia-Pacific Region Workshops, HPC Asia 2024 Workshops
Country/TerritoryJapan
CityNagoya
Period25/01/24 → …

Keywords

  • Dependency
  • Graph
  • HPC
  • MPI
  • OpenMP
  • Task

Fingerprint

Dive into the research topics of 'An Overview on Mixing MPI and OpenMP Dependent Tasking on A64FX'. Together they form a unique fingerprint.

Cite this