Speaker
Yao-Yuan Mao
(National Taiwan University)
Description
We present a GPU-based conjugate gradient (CG) solver for lattice QCD with domain-wall fermion (DWF). It is well known that CG is the most time-consuming part in the hybrid Monte Carlo simulation of unquenched lattice QCD, which becomes even more computational demanding for lattice QCD with exact chiral symmetry. We have implemented the CG for a general 5-dimensional DWF operator on NVIDIA's CUDA architecture with mixed-precision, using the defect correction as well as the reliable updates algorithms. For NVIDIA's GTX 285, our CG solver for the optimal domain-wall fermion achieves 180 Gflops (sustained). Since the performances of the defect correction and the reliable updates depend on the source vector, which also change at different stages of the CG, we investigate a dynamically adaptive scheme which can pick the best one among these two algorithms such that the entire CG can attain the optimal performance.
Please, insert your presentation type (talk, poster) | talk |
---|
Primary authors
Dr
Kenji Ogawa
(National Taiwan University)
Prof.
Ting-Wai Chiu
(National Taiwan University)
Yao-Yuan Mao
(National Taiwan University)