pMR: A high-performance communication library
January 30, 2017 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Peter Georg, Daniel Richtmann, Tilo Wettig
arXiv ID
1701.08521
Category
hep-lat
Cross-listed
cs.DC,
physics.comp-ph
Citations
9
Venue
arXiv.org
Last Checked
3 months ago
Abstract
On many parallel machines, the time LQCD applications spent in communication is a significant contribution to the total wall-clock time, especially in the strong-scaling limit. We present a novel high-performance communication library that can be used as a de facto drop-in replacement for MPI in existing software. Its lightweight nature that avoids some of the unnecessary overhead introduced by MPI allows us to improve the communication performance of applications without any algorithmic or complicated implementation changes. As a first real-world benchmark, we make use of the pMR library in the coarse-grid solve of the Regensburg implementation of the DD-$Ξ±$AMG algorithm. On realistic lattices, we see an improvement of a factor 2x in pure communication time and total execution time savings of up to 20%.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β hep-lat
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Lattice gauge equivariant convolutional neural networks
R.I.P.
π»
Ghosted
Aspects of scaling and scalability for flow-based sampling of lattice QCD
R.I.P.
π»
Ghosted
Gauge Equivariant Neural Networks for 2+1D U(1) Gauge Theory Simulations in Hamiltonian Formulation
R.I.P.
π»
Ghosted
Job Management and Task Bundling
R.I.P.
π»
Ghosted
Simulating the weak death of the neutron in a femtoscale universe with near-Exascale computing
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted