Measuring and interpreting performances of HPC applications with dependent tasks

Romain Pereira, Thierry Gautier, Adrien Roussel, Patrick Carribault

Research output: Contribution to journalArticlepeer-review

Abstract

Breaking down the parallel time into work, idleness, and overheads is crucial for assessing the performance of HPC applications but is challenging to measure in runtime systems with dependent tasks. No existing tools allow its measurement accurately. This paper introduces POT: a tool-suite for parallel applications performance analysis with support for dependent tasks. We focus on its low-disturbance methodology consisting of parallel object modeling, discrete-event tracing, and post-mortem simulation-based analysis. The POT tool-suite allows the tracing and analysis of OMPT (OpenMP), PMPI (MPI) and pthreads events. The paper evaluates the accuracy of POT's analysis on LLVM and MPC-OMP implementations. It shows that measurement bias may be neglected above 16μs workload per task, portably across two architectures and OpenMP runtime systems. We also illustrate the benefits unveiled by POT post-mortem simulation approach for analyzing mixed programming models with MPI+OpenMP.

Original languageEnglish
Article number107933
JournalFuture Generation Computer Systems
Volume174
DOIs
Publication statusPublished - 1 Jan 2026
Externally publishedYes

Keywords

  • High performance computing
  • MPI
  • OpenMP
  • Performance tool
  • Tasks

Fingerprint

Dive into the research topics of 'Measuring and interpreting performances of HPC applications with dependent tasks'. Together they form a unique fingerprint.

Cite this