Extending DD-$Ξ±$AMG on heterogeneous machines
July 10, 2024 Β· Declared Dead Β· + Add venue
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Gustavo Ramirez-Hidalgo, Lianhua He, Ke-Long Zhang
arXiv ID
2407.08092
Category
hep-lat
Cross-listed
cs.DC,
math.NA
Citations
0
Last Checked
3 months ago
Abstract
Multigrid solvers are the standard in modern scientific computing simulations. Domain Decomposition Aggregation-Based Algebraic Multigrid, also known as the DD-$Ξ±$AMG solver, is a successful realization of an algebraic multigrid solver for lattice quantum chromodynamics. Its CPU implementation has made it possible to construct, for some particular discretizations, simulations otherwise computationally unfeasible, and furthermore it has motivated the development and improvement of other algebraic multigrid solvers in the area. From an existing version of DD-$Ξ±$AMG already partially ported via CUDA to run some finest-level operations of the multigrid solver on Nvidia GPUs, we translate the CUDA code here by using HIP to run on the ORISE supercomputer. We moreover extend the smoothers available in DD-$Ξ±$AMG, paying particular attention to Richardson smoothing, which in our numerical experiments has led to a multigrid solver faster than smoothing with GCR and only 10% slower compared to SAP smoothing. Then we port the odd-even-preconditioned versions of GMRES and Richardson via CUDA. Finally, we extend some computationally intensive coarse-grid operations via advanced vectorization.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β hep-lat
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Lattice gauge equivariant convolutional neural networks
R.I.P.
π»
Ghosted
Aspects of scaling and scalability for flow-based sampling of lattice QCD
R.I.P.
π»
Ghosted
Gauge Equivariant Neural Networks for 2+1D U(1) Gauge Theory Simulations in Hamiltonian Formulation
R.I.P.
π»
Ghosted
Job Management and Task Bundling
R.I.P.
π»
Ghosted
Simulating the weak death of the neutron in a femtoscale universe with near-Exascale computing
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted