A scalable and generic task scheduling system for communication libraries

Francois Trahay, Alexandre Denis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Since the advent of multi-core processors, the physiognomy of typical clusters has dramatically evolved. This new massively multi-core era is a major change in architecture, causing the evolution of programming models towards hybrid MPI+threads, therefore requiring new features at low-level. Modern communication subsystems now have to deal with multithreading: the impact of thread-safety, the contention on network interfaces or the consequence of data locality on performance have to be studied carefully. In this paper, we present PIOMan, a scalable and generic lightweight task scheduling system for communication libraries. It is designed to ensure concurrent progression of multiple tasks of a communication library (polling, offload, multi-rail) through the use of multiple cores, while preserving locality to avoid contention and allow a scalability to a large number of cores and threads. We have implemented the model, evaluated its performance, and compared it to state of the art solutions regarding overhead, scalability, and communication and computation overlap.

Original languageEnglish
Title of host publication2009 IEEE International Conference on Cluster Computing and Workshops, CLUSTER '09
DOIs
Publication statusPublished - 21 Dec 2009
Externally publishedYes
Event2009 IEEE International Conference on Cluster Computing and Workshops, CLUSTER '09 - New Orleans, LA, United States
Duration: 31 Aug 20094 Sept 2009

Publication series

NameProceedings - IEEE International Conference on Cluster Computing, ICCC
ISSN (Print)1552-5244

Conference

Conference2009 IEEE International Conference on Cluster Computing and Workshops, CLUSTER '09
Country/TerritoryUnited States
CityNew Orleans, LA
Period31/08/094/09/09

Fingerprint

Dive into the research topics of 'A scalable and generic task scheduling system for communication libraries'. Together they form a unique fingerprint.

Cite this