TY - JOUR
T1 - Accelerating VASP electronic structure calculations using graphic processing units
AU - Hacene, Mohamed
AU - Anciaux-Sedrakian, Ani
AU - Rozanska, Xavier
AU - Klahr, Diego
AU - Guignon, Thomas
AU - Fleurat-Lessard, Paul
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledged KAUST grant number(s): UK-C0017
Acknowledgements: Contract/grant sponsor: King Abdullah University of Science and Technology (KAUST, Award No. UK-C0017).
This publication acknowledges KAUST support, but has no KAUST affiliated authors.
PY - 2012/8/20
Y1 - 2012/8/20
N2 - We present a way to improve the performance of the electronic structure Vienna Ab initio Simulation Package (VASP) program. We show that high-performance computers equipped with graphics processing units (GPUs) as accelerators may reduce drastically the computation time when offloading these sections to the graphic chips. The procedure consists of (i) profiling the performance of the code to isolate the time-consuming parts, (ii) rewriting these so that the algorithms become better-suited for the chosen graphic accelerator, and (iii) optimizing memory traffic between the host computer and the GPU accelerator. We chose to accelerate VASP with NVIDIA GPU using CUDA. We compare the GPU and original versions of VASP by evaluating the Davidson and RMM-DIIS algorithms on chemical systems of up to 1100 atoms. In these tests, the total time is reduced by a factor between 3 and 8 when running on n (CPU core + GPU) compared to n CPU cores only, without any accuracy loss. © 2012 Wiley Periodicals, Inc.
AB - We present a way to improve the performance of the electronic structure Vienna Ab initio Simulation Package (VASP) program. We show that high-performance computers equipped with graphics processing units (GPUs) as accelerators may reduce drastically the computation time when offloading these sections to the graphic chips. The procedure consists of (i) profiling the performance of the code to isolate the time-consuming parts, (ii) rewriting these so that the algorithms become better-suited for the chosen graphic accelerator, and (iii) optimizing memory traffic between the host computer and the GPU accelerator. We chose to accelerate VASP with NVIDIA GPU using CUDA. We compare the GPU and original versions of VASP by evaluating the Davidson and RMM-DIIS algorithms on chemical systems of up to 1100 atoms. In these tests, the total time is reduced by a factor between 3 and 8 when running on n (CPU core + GPU) compared to n CPU cores only, without any accuracy loss. © 2012 Wiley Periodicals, Inc.
UR - http://hdl.handle.net/10754/597443
UR - http://doi.wiley.com/10.1002/jcc.23096
UR - http://www.scopus.com/inward/record.url?scp=84869206889&partnerID=8YFLogxK
U2 - 10.1002/jcc.23096
DO - 10.1002/jcc.23096
M3 - Article
C2 - 22903247
SN - 0192-8651
VL - 33
SP - 2581
EP - 2589
JO - Journal of Computational Chemistry
JF - Journal of Computational Chemistry
IS - 32
ER -