TY - JOUR
T1 - Performance characteristics of hybrid MPI/OpenMP implementations of NAS parallel benchmarks SP and BT on large-scale multicore supercomputers
AU - Wu, Xingfu
AU - Taylor, Valerie
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledged KAUST grant number(s): KUS-I1-010-01
Acknowledgements: This work is supported by NSF grant CNS-0911023, and theAward No. KUS-I1-010-01 made by King Abdullah University ofScience and Technology (KAUST). The authors would like toacknowledge Argonne Leadership Computing Facility at ArgonneNational Laboratory for the use of BlueGene/P and National Centerfor Computational Science at Oak Ridge National Laboratory forthe use of Jaguar and JaguarPF under DOE INCITE project“Performance Evaluation and Analysis Consortium End Station”,and Haoqiang Jin from NASA Ames Research Center for providinghis BT code.
This publication acknowledges KAUST support, but has no KAUST affiliated authors.
PY - 2011/3/29
Y1 - 2011/3/29
N2 - The NAS Parallel Benchmarks (NPB) are well-known applications with the fixed algorithms for evaluating parallel systems and tools. Multicore supercomputers provide a natural programming paradigm for hybrid programs, whereby OpenMP can be used with the data sharing with the multicores that comprise a node and MPI can be used with the communication between nodes. In this paper, we use SP and BT benchmarks of MPI NPB 3.3 as a basis for a comparative approach to implement hybrid MPI/OpenMP versions of SP and BT. In particular, we can compare the performance of the hybrid SP and BT with the MPI counterparts on large-scale multicore supercomputers. Our performance results indicate that the hybrid SP outperforms the MPI SP by up to 20.76%, and the hybrid BT outperforms the MPI BT by up to 8.58% on up to 10,000 cores on BlueGene/P at Argonne National Laboratory and Jaguar (Cray XT4/5) at Oak Ridge National Laboratory. We also use performance tools and MPI trace libraries available on these supercomputers to further investigate the performance characteristics of the hybrid SP and BT.
AB - The NAS Parallel Benchmarks (NPB) are well-known applications with the fixed algorithms for evaluating parallel systems and tools. Multicore supercomputers provide a natural programming paradigm for hybrid programs, whereby OpenMP can be used with the data sharing with the multicores that comprise a node and MPI can be used with the communication between nodes. In this paper, we use SP and BT benchmarks of MPI NPB 3.3 as a basis for a comparative approach to implement hybrid MPI/OpenMP versions of SP and BT. In particular, we can compare the performance of the hybrid SP and BT with the MPI counterparts on large-scale multicore supercomputers. Our performance results indicate that the hybrid SP outperforms the MPI SP by up to 20.76%, and the hybrid BT outperforms the MPI BT by up to 8.58% on up to 10,000 cores on BlueGene/P at Argonne National Laboratory and Jaguar (Cray XT4/5) at Oak Ridge National Laboratory. We also use performance tools and MPI trace libraries available on these supercomputers to further investigate the performance characteristics of the hybrid SP and BT.
UR - http://hdl.handle.net/10754/599158
UR - https://dl.acm.org/doi/10.1145/1964218.1964228
U2 - 10.1145/1964218.1964228
DO - 10.1145/1964218.1964228
M3 - Article
SN - 0163-5999
VL - 38
SP - 56
JO - ACM SIGMETRICS Performance Evaluation Review
JF - ACM SIGMETRICS Performance Evaluation Review
IS - 4
ER -