Kids Library Home

Welcome to the Kids' Library!

Search for books, movies, music, magazines, and more.

     
Available items only
Print Material
Author Thomas, Stephen (Mathematician), author.

Title Two-stage Gauss-Seidel preconditioners and smoothers for Krylov solvers on a GPU cluster: preprint / Stephen Thomas and [seven others].

Publication Info. Golden, CO : National Renewable Energy Laboratory, 2022.

Copies

Description 1 online resource (23 pages) : color illustrations.
text txt rdacontent
computer c rdamedia
online resource cr rdacarrier
Series NREL/CP ; 2C00-80263
Conference paper (National Renewable Energy Laboratory (U.S.)) ; 2C00-80263.
Note "February 2022."
"Presented at the SIAM Annual Meeting, July 19-23, 2021"--Cover.
Bibliography Includes bibliographical references (pages 21-23).
Funding DE-AC36-08GO28308
Note Description based on online resource; title from PDF title page (NREL, viewed June 8, 2022).
Summary Gauss-Seidel (GS) relaxation is often employed as a preconditioner for a Krylov solver or as a smoother for Algebraic Multigrid (AMG). However, the requisite sparse triangular solve is difficult to parallelize on many-core architectures such as graphics processing units (GPUs). In the present study, the performance of the sequential GS relaxation based on a triangular solve is compared with two-stage variants, replacing the direct triangular solve with a fixed number of inner Jacobi-Richardson (JR) iterations. When a small number of inner iterations is sufficient to maintain the Krylov convergence rate, the two-stage GS (GS2) often outperforms the sequential algorithm on many-core architectures. The GS2 algorithm is also compared with JR. When they perform the same number of ops for SpMV (e.g. three JR sweeps compared to two GS sweeps with one inner JR sweep), the GS2 iterations, and the Krylov solver preconditioned with GS2, may converge faster than the JR iterations. Moreover, for some problems (e.g. elasticity), it was found that JR may diverge with a damping factor of one, whereas two-stage GS may improve the convergence with more inner iterations. Finally, to study the performance of the two-stage smoother and preconditioner for a practical problem, these were applied to incompressible uid ow simulations on GPUs.
Subject Graphics processing units -- Testing.
Parallel programming (Computer science)
Parallel processing (Electronic computers)
Programmation parallèle (Informatique)
Parallélisme (Informatique)
Parallel processing (Electronic computers)
Parallel programming (Computer science)
Indexed Term algebraic multigrid
AMG
Gauss-Seidel
krylov
preconditioners
smoothers
Added Author National Renewable Energy Laboratory (U.S.), issuing body.
Standard No. 1845268 OSTI ID
Gpo Item No. 0430-P-04 (online)
Sudoc No. E 9.17:NREL/CP-2 C 00-80263

 
    
Available items only