Topology-oblivious optimization of MPI broadcast algorithms on extreme-scale platforms

Khalid Hasanov, Jean-Noël Quintin, Alexey Lastovetsky

Research output: Contribution to journalArticlepeer-review

6 Scopus citations


© 2015 Elsevier B.V. All rights reserved. Significant research has been conducted in collective communication operations, in particular in MPI broadcast, on distributed memory platforms. Most of the research efforts aim to optimize the collective operations for particular architectures by taking into account either their topology or platform parameters. In this work we propose a simple but general approach to optimization of the legacy MPI broadcast algorithms, which are widely used in MPICH and Open MPI. The proposed optimization technique is designed to address the challenge of extreme scale of future HPC platforms. It is based on hierarchical transformation of the traditionally flat logical arrangement of communicating processors. Theoretical analysis and experimental results on IBM BlueGene/P and a cluster of the Grid'5000 platform are presented.
Original languageEnglish (US)
Pages (from-to)30-39
Number of pages10
JournalSimulation Modelling Practice and Theory
StatePublished - Nov 2015
Externally publishedYes


Dive into the research topics of 'Topology-oblivious optimization of MPI broadcast algorithms on extreme-scale platforms'. Together they form a unique fingerprint.

Cite this