TY - GEN
T1 - MPI Overlap
T2 - 45th International Conference on Parallel Processing, ICPP 2016
AU - Denis, Alexandre
AU - Trahay, Francois
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/9/21
Y1 - 2016/9/21
N2 - In HPC applications, one of the major overhead compared to sequentiel code, is communication cost. Application programmers often amortize this cost by overlapping communications with computation. To do so, they post a non-blocking MPI request, perform computation, and wait for communication completion, assuming MPI communication will progress in background. In this paper, we propose to measure what really happens when trying to overlap non-blocking point-to-point communications with computation. We explain how background progression works, we describe relevant test cases, we identify challenges for a benchmark, then we propose a benchmark suite to measure how much overlap happen in various cases. We exhibit overlap benchmark results on a wide panel of MPI libraries and hardware platforms. Finally, we classify, analyze, and explain the results using low-level traces to reveal the internal behavior of the MPI library.
AB - In HPC applications, one of the major overhead compared to sequentiel code, is communication cost. Application programmers often amortize this cost by overlapping communications with computation. To do so, they post a non-blocking MPI request, perform computation, and wait for communication completion, assuming MPI communication will progress in background. In this paper, we propose to measure what really happens when trying to overlap non-blocking point-to-point communications with computation. We explain how background progression works, we describe relevant test cases, we identify challenges for a benchmark, then we propose a benchmark suite to measure how much overlap happen in various cases. We exhibit overlap benchmark results on a wide panel of MPI libraries and hardware platforms. Finally, we classify, analyze, and explain the results using low-level traces to reveal the internal behavior of the MPI library.
KW - Benchmark
KW - HPC
KW - MPI
KW - Overlap
U2 - 10.1109/ICPP.2016.37
DO - 10.1109/ICPP.2016.37
M3 - Conference contribution
AN - SCOPUS:84990935710
T3 - Proceedings of the International Conference on Parallel Processing
SP - 258
EP - 267
BT - Proceedings - 45th International Conference on Parallel Processing, ICPP 2016
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 16 August 2016 through 19 August 2016
ER -