GPU-based multi-group discrete ordinates transport calculations: Parallel computing implementation in STRAUMopen access
- Authors
- Zhang, Ao; Hong, Ser Gi; Jeong, Seungil; Chen, Jingen
- Issue Date
- Sep-2025
- Publisher
- 한국원자력학회
- Keywords
- Multi-GPUs; Parallel computing; Group chunk decomposition; Unstructured SN transport
- Citation
- Nuclear Engineering and Technology, v.57, no.9, pp 1 - 12
- Pages
- 12
- Indexed
- SCIE
SCOPUS
KCI
- Journal Title
- Nuclear Engineering and Technology
- Volume
- 57
- Number
- 9
- Start Page
- 1
- End Page
- 12
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/210039
- DOI
- 10.1016/j.net.2025.103597
- ISSN
- 1738-5733
2234-358X
- Abstract
- Discrete ordinates (SN) method with unstructured meshes is highly appropriate for high-fidelity modeling and simulation of radiation shielding problems with complicated geometries. However, the large number of unknowns resulting from discretization of the transport equation on the spatial, angular, and energy variables, necessitates the use of parallel computing to achieve efficient solutions. In this work, a GPU-based SN transport sweep algorithm combined with a GPU-parallel multi-group Krylov subspace solver has been proposed and implemented in the STRAUM (SN Transport for Radiation Analysis with Unstructured Meshes) code. A group chunk decomposition method within the framework of the multi-group Krylov subspace solver has been applied to STRAUM to leverage multi-GPU parallel computing. For the Kobayashi-like and reactor pressure vessel problems, STRAUM typically runs faster by factors of 100∼200 on a single NVIDIA GeForce RTX 4090 GPU and by factors of 70∼120 on a single NVIDIA GeForce RTX 3080 Ti GPU than on a single AMD Ryzen 9 7900X CPU core. For the simulations on dual-GPU systems, the group chunk decomposition method achieves parallel computing efficiencies greater than 90% without degradation in convergence except for cases using very coarse angular divisions. Besides, this method reduces per-GPU memory usage by more than 40% and enables STRAUM to effectively simulate problems with up to ten billion unknowns using two RTX 4090 GPUs.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 원자력공학과 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.