Evaluating Data Redistribution in PaRSEC

Qinglei Cao, George Bosilca, Nuria Losada, Wei Wu, Dong Zhong, Jack Dongarra

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

Data redistribution aims to reshuffle data to optimize some objective for an algorithm. The objective can be multi-dimensional, such as improving computational load balance or decreasing communication volume or cost, with the ultimate goal of increasing the efficiency and therefore reducing the time-to-solution for the algorithm. The classic redistribution problem focuses on optimally scheduling communications when reshuffling data between two regular, usually block-cyclic, data distributions. Besides distribution, data size is also a performance-critical parameter because it affects the reshuffling algorithm in terms of cache, communication efficiency, and potential parallelism. In addition, task-based runtime systems have gained popularity recently as a potential candidate to address the programming complexity on the way to exascale. In this scenario, it becomes paramount to develop a flexible redistribution algorithm for task-based runtime systems, which could support all types of regular and irregular data distributions and take data size into account. In this article, we detail a flexible redistribution algorithm and implement an efficient approach in a task-based runtime system, PaRSEC. Performance results show great capability compared to the theoretical bound and ScaLAPACK, and applications highlight an increased efficiency with little overhead in terms of data distribution, data size, and data format.
Original languageEnglish (US)
Pages (from-to)1856-1872
Number of pages17
JournalIEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS
Volume33
Issue number8
DOIs
StatePublished - Nov 30 2021
Externally publishedYes

Fingerprint

Dive into the research topics of 'Evaluating Data Redistribution in PaRSEC'. Together they form a unique fingerprint.

Cite this