TY - JOUR
T1 - Simulating MPI Applications: The SMPI Approach
AU - Degomme, Augustin
AU - Legrand, Arnaud
AU - Markomanolis, Georgios
AU - Quinson, Martin
AU - Stillwell, Mark
AU - Suter, Frédéric
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: The authors would like to thank the SimGrid team members and collaborators who contributed to SMPI: P. Bedaride, H. Casanova, P.-N. Clauss, F. Desprez, S. Genaud, A. Gupta, and B. Videau. This work is partially supported by the Hac Specis Inria Project Lab, the ANR SONGS (11- ANR-INFR-13), CNRS PICS N degree 5473, and European Mont-Blanc (EC grant 288777) projects. Experiments were carried out on a PRACE (EC grants RI-261557 and RI283493) prototype and the Grid’5000 experimental testbed, supported by a scientific interest group hosted by Inria and including CNRS, RENATER, and other Universities and organizations.
PY - 2017/8/1
Y1 - 2017/8/1
N2 - This article summarizes our recent work and developments on SMPI, a flexible simulator of MPI applications. In this tool, we took a particular care to ensure our simulator could be used to produce fast and accurate predictions in a wide variety of situations. Although we did build SMPI on SimGrid whose speed and accuracy had already been assessed in other contexts, moving such techniques to a HPC workload required significant additional effort. Obviously, an accurate modeling of communications and network topology was one of the key to such achievements. Another less obvious key was the choice to combine in a single tool the possibility to do both offline and online simulation.
AB - This article summarizes our recent work and developments on SMPI, a flexible simulator of MPI applications. In this tool, we took a particular care to ensure our simulator could be used to produce fast and accurate predictions in a wide variety of situations. Although we did build SMPI on SimGrid whose speed and accuracy had already been assessed in other contexts, moving such techniques to a HPC workload required significant additional effort. Obviously, an accurate modeling of communications and network topology was one of the key to such achievements. Another less obvious key was the choice to combine in a single tool the possibility to do both offline and online simulation.
UR - http://hdl.handle.net/10754/655928
UR - https://ieeexplore.ieee.org/document/7855780/
UR - http://www.scopus.com/inward/record.url?scp=85029049825&partnerID=8YFLogxK
U2 - 10.1109/TPDS.2017.2669305
DO - 10.1109/TPDS.2017.2669305
M3 - Article
SN - 1045-9219
VL - 28
SP - 2387
EP - 2400
JO - IEEE Transactions on Parallel and Distributed Systems
JF - IEEE Transactions on Parallel and Distributed Systems
IS - 8
ER -