TY - JOUR
T1 - A 1.8 trillion degrees-of-freedom, 1.24 petaflops global seismic wave simulation on the K computer
AU - Tsuboi, Seiji
AU - Ando, Kazuto
AU - Miyoshi, Takayuki
AU - Peter, Daniel
AU - Komatitsch, Dimitri
AU - Tromp, Jeroen
N1 - KAUST Repository Item: Exported on 2020-10-01
PY - 2016/7/27
Y1 - 2016/7/27
N2 - We present high-performance simulations of global seismic wave propagation with an unprecedented accuracy of 1.2 s seismic period for a realistic three-dimensional Earth model using the spectral element method on the K computer. Our seismic simulations use a total of 665.2 billion grid points and resolve 1.8 trillion degrees of freedom. To realize these large-scale computations, we optimize a widely used community software code to efficiently address all hardware parallelization, especially thread-level parallelization to solve the bottleneck of memory usage for coarse-grained parallelization. The new code exhibits excellent strong scaling for the time stepping loop, that is, parallel efficiency on 82,134 nodes relative to 36,504 nodes is 99.54%. Sustained performance of these computations on the K computer is 1.24 petaflops, which is 11.84% of its peak performance. The obtained seismograms with an accuracy of 1.2 s for the entire globe should help us to better understand rupture mechanisms of devastating earthquakes.
AB - We present high-performance simulations of global seismic wave propagation with an unprecedented accuracy of 1.2 s seismic period for a realistic three-dimensional Earth model using the spectral element method on the K computer. Our seismic simulations use a total of 665.2 billion grid points and resolve 1.8 trillion degrees of freedom. To realize these large-scale computations, we optimize a widely used community software code to efficiently address all hardware parallelization, especially thread-level parallelization to solve the bottleneck of memory usage for coarse-grained parallelization. The new code exhibits excellent strong scaling for the time stepping loop, that is, parallel efficiency on 82,134 nodes relative to 36,504 nodes is 99.54%. Sustained performance of these computations on the K computer is 1.24 petaflops, which is 11.84% of its peak performance. The obtained seismograms with an accuracy of 1.2 s for the entire globe should help us to better understand rupture mechanisms of devastating earthquakes.
UR - http://hdl.handle.net/10754/622498
UR - http://journals.sagepub.com/doi/10.1177/1094342016632596
UR - http://www.scopus.com/inward/record.url?scp=84995378480&partnerID=8YFLogxK
U2 - 10.1177/1094342016632596
DO - 10.1177/1094342016632596
M3 - Article
SN - 1094-3420
VL - 30
SP - 411
EP - 422
JO - International Journal of High Performance Computing Applications
JF - International Journal of High Performance Computing Applications
IS - 4
ER -