Evaluation of OpenMP task scheduling algorithms for large NUMA architectures

Jérôme Clet-Ortega, Patrick Carribault, Marc Pérache

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Current generation of high performance computing platforms tends to hold a large number of cores. Therefore applications have to expose a fine-grain parallelism to be more efficient. Since version 3.0, the OpenMP standard proposes a way to express such parallelism through tasks. Because the task scheduling strategy is implementation defined, each runtime can have a different behavior and efficiency. Notwithstanding, the hierarchical characteristic of current parallel computing systems is rarely considered. This might come down to a loss of performance on large multicore NUMA systems. This paper studies multiple task scheduling algorithms with a configurable scheduler. It relies on a topology-aware tree-based representation of the computing platform to orchestrate the execution and the load-balacing of OpenMP tasks. High-end users can select the task-list granularity according to the tree structure and choose the most convenient work-stealing strategy. One of these strategies takes into account data locality with the help of the hierarchical view. It performs well with unbalanced codes, from BOTS benchmarks, in comparison to Intel and GNU OpenMP runtimes on 16-core and 128-core systems.

Original languageEnglish
Title of host publicationEuro-Par 2014
Subtitle of host publicationParallel Processing - 20th International Conference, Proceedings
EditorsFernando Silva, Inês Dutra, Vítor Santos Costa
PublisherSpringer Verlag
Pages596-607
Number of pages12
ISBN (Electronic)9783319098722
ISBN (Print)9783319098722
DOIs
Publication statusPublished - 1 Jan 2014
Externally publishedYes
Event20th International Conference on Parallel Processing, Euro-Par 2014 - Porto, Portugal
Duration: 25 Aug 201429 Aug 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8632 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference20th International Conference on Parallel Processing, Euro-Par 2014
Country/TerritoryPortugal
CityPorto
Period25/08/1429/08/14

Fingerprint

Dive into the research topics of 'Evaluation of OpenMP task scheduling algorithms for large NUMA architectures'. Together they form a unique fingerprint.

Cite this