Passer à la navigation principale Passer à la recherche Passer au contenu principal

A sampling-based approach for communication libraries auto-tuning

  • CNRS SAMOVAR UMR 5157
  • INRIA Institut National de Recherche en Informatique et en Automatique
  • Univ. Bordeaux

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

Communication performance is a critical issue in HPC applications, and many solutions have been proposed on the literature (algorithmic, protocols, etc.) In the meantime, computing nodes become massively multicore, leading to a real imbalance between the number of communication sources and the number of physical communication resources. Thus it is now mandatory to share network boards between computation flows, and to take this sharing into account while performing communication optimizations. In previous papers, we have proposed a model and a framework for on-the-fly optimizations of multiplexed concurrent communication flows, and implemented this model in the nm communication library. This library features optimization strategies able for example to aggregate several messages to reduce the number of packets emitted on the network, or to split messages to use several NICs at the same time. In this paper, we study the tuning of these dynamic optimization strategies. We show that some parameters and thresholds (rdv threshold, aggregation packet size) depend on the actual hardware, both host and NICs. We propose and implement a method based on sampling of the actual hardware to auto-tune our strategies. Moreover, we show that multi-rail can greatly benefit from performance predictions. We propose an approach for multi-rail that dynamically balance the data between NICs using predictions based on sampling.

langue originaleAnglais
titreProceedings - 2011 IEEE International Conference on Cluster Computing, CLUSTER 2011
Pages299-307
Nombre de pages9
Les DOIs
étatPublié - 16 nov. 2011
Modification externeOui
Evénement2011 IEEE International Conference on Cluster Computing, CLUSTER 2011 - Austin, TX, États-Unis
Durée: 26 sept. 201130 sept. 2011

Série de publications

NomProceedings - IEEE International Conference on Cluster Computing, ICCC
ISSN (imprimé)1552-5244

Une conférence

Une conférence2011 IEEE International Conference on Cluster Computing, CLUSTER 2011
Pays/TerritoireÉtats-Unis
La villeAustin, TX
période26/09/1130/09/11

Empreinte digitale

Examiner les sujets de recherche de « A sampling-based approach for communication libraries auto-tuning ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation