Abstract
A scalable parallelization algorithm to port an explicit marching-on-in-time (MOT)-based time domain volume integral equation (TDVIE) solver onto multi-GPUs is described. The algorithm makes use of MPI and OpenACC for efficient implementation. The MPI processes are responsible for synchronizing and communicating the distributed compute kernels of the MOT-TDVIE solver between the GPUs, where one MPI task is assigned to one GPU. The compiler directives of the OpenACC are responsible for the data transfer and kernels’ offloading from the CPU to the GPU and their execution on the GPU. The speedups achieved against the MPI/OpenMP code execution on multiple CPUs and parallel efficiencies are presented.
Original language | English (US) |
---|---|
Pages (from-to) | 164-167 |
Number of pages | 4 |
Journal | Applied Computational Electromagnetics Society Journal |
Volume | 33 |
Issue number | 2 |
State | Published - Feb 2018 |
Keywords
- Explicit marching-on-in-time scheme
- GPU
- MPI
- OpenACC
- Time-domain volume integral equation
ASJC Scopus subject areas
- Astronomy and Astrophysics
- Electrical and Electronic Engineering