Optimizing tensor contractions in CCSD(T) for efficient execution on GPUs
- Authors
- Kim, J.; Sukumaran-Rajam, A.; Hong, C.; Panyala, A.; Srivastava, R.K.; Krishnamoorthy, S.; Sadayappan, P.
- Issue Date
- Jun-2018
- Publisher
- Association for Computing Machinery
- Keywords
- CCSD(T); Coupled Cluster Methods; GPU Computing; Loop Fusion; Tensor Contractions
- Citation
- Proceedings of the International Conference on Supercomputing, pp 96 - 106
- Pages
- 11
- Journal Title
- Proceedings of the International Conference on Supercomputing
- Start Page
- 96
- End Page
- 106
- URI
- https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/63871
- DOI
- 10.1145/3205289.3205296
- Abstract
- Tensor contractions are higher dimensional analogs of matrix multiplications, used in many computational contexts such as high order models in quantum chemistry, deep learning, finite element methods etc. In contrast to the wide availability of high-performance libraries for matrix multiplication on GPUs, the same is not true for tensor contractions. In this paper, we address the optimization of a set of symmetrized tensor contractions that form the computational bottleneck in the CCSD(T) coupled-cluster method in computational chemistry suites like NWChem. Some of the challenges in optimizing tensor contractions that arise in practice from the variety of dimensionalities and shapes for tensors include effective mapping of the high-dimensional iteration space to threads, choice of data buffering in shared-memory and registers, and tile sizes for multi-level tiling. Furthermore, in the case of symmetrized tensor contractions in CCSD(T), it is also a challenge to fuse contractions to reduce data movement cost by exploiting reuse of intermediate tensors. In this paper, we develop an efficient GPU implementation of the tensor contractions in CCSD(T) using shared-memory buffering, register tiling, loop fusion and register transpose. Experimental results demonstrate significant improvement over the current state-of-the-art. © 2018 Association for Computing Machinery.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Software > School of Computer Science and Engineering > 1. Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/63871)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.