14–19 Jun 2010
Villasimius, Sardinia
Europe/Rome timezone

Domain decomposition method on GPU cluster for Lattice

18 Jun 2010, 17:40
20m
Room3 (Villasimius, Sardinia)

Room3

Villasimius, Sardinia

Tanka Village
Algorithms and machines Parallel 54: Algorithms and machines

Speaker

Yusuke Osaki (Hiroshima University, depertment of physical science)

Description

The modern graphic processing unit (GPU) has become one of the powerful computational device. Applying GPU to the lattice QCD computation has been investigated especially for the inversion solver of the discretized Dirac operators. Most of the previous works on lattice QCD with GPU, however, have been done with single GPU card. Since the single GPU performance and the memory size are limited at a few TFlops and of a few GBytes, the lattice sizes are limited to rather small sizes. Thus, to simulate QCD on more realistic lattice we need a cluster system with multiple GPU’s. The GPU cluster system requires a special care on the GPU- GPU-intercommunication because the modern GPU lacks the direct GPU-GPU communication feature. In order to make a good performance with a cluster with GPU, we investigated a domain decomposition preconditioner for the O(a)-improved Wilson-Dirac fermion inverter, where the lattice is decomposed into domains. In this talk we report some performance results on the additive Schwarz preconditioner on GPU cluster computing.
Please, insert your presentation type (talk, poster) talk

Primary author

Yusuke Osaki (Hiroshima University, depertment of physical science)

Co-author

Dr Ken-Ichi Ishikawa (Hiroshima University, depertment of physical science)

Presentation materials