A Study of Performance Portability in Plasma Physics Simulations
October 19, 2024 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Josef Ruzicka, Christian Asch, Esteban Meneses, Markus Rampp, Erwin Laure
arXiv ID
2411.05009
Category
physics.plasm-ph
Cross-listed
cs.DC,
cs.PF,
physics.comp-ph
Citations
2
Venue
arXiv.org
Last Checked
3 months ago
Abstract
The high-performance computing (HPC) community has recently seen a substantial diversification of hardware platforms and their associated programming models. From traditional multicore processors to highly specialized accelerators, vendors and tool developers back up the relentless progress of those architectures. In the context of scientific programming, it is fundamental to consider performance portability frameworks, i.e., software tools that allow programmers to write code once and run it on different computer architectures without sacrificing performance. We report here on the benefits and challenges of performance portability using a field-line tracing simulation and a particle-in-cell code, two relevant applications in computational plasma physics with applications to magnetically-confined nuclear-fusion energy research. For these applications we report performance results obtained on four HPC platforms with server-class CPUs from Intel (Xeon) and AMD (EPYC), and high-end GPUs from Nvidia and AMD, including the latest Nvidia H100 GPU and the novel AMD Instinct MI300A APU. Our results show that both Kokkos and OpenMP are powerful tools to achieve performance portability and decent "out-of-the-box" performance, even for the very latest hardware platforms. For our applications, Kokkos provided performance portability to the broadest range of hardware architectures from different vendors.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β physics.plasm-ph
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Plasma Surrogate Modelling using Fourier Neural Operators
R.I.P.
π»
Ghosted
Deep Learning for Plasma Tomography and Disruption Prediction from Bolometer Data
R.I.P.
π»
Ghosted
Machine learning plasma-surface interface for coupling sputtering and gas-phase transport simulations
R.I.P.
π»
Ghosted
Enhancing predictive capabilities in fusion burning plasmas through surrogate-based optimization in core transport solvers
R.I.P.
π»
Ghosted
Extracting Electron Scattering Cross Sections from Swarm Data using Deep Neural Networks
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted