A GPU-Accelerated Fast Summation Method Based on Barycentric Lagrange Interpolation and Dual Tree Traversal
December 13, 2020 ยท Declared Dead ยท ๐ Computer Physics Communications
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Leighton Wilson, Nathan Vaughn, Robert Krasny
arXiv ID
2012.06925
Category
physics.comp-ph
Cross-listed
cs.DC
Citations
16
Venue
Computer Physics Communications
Last Checked
2 months ago
Abstract
We present the barycentric Lagrange dual tree traversal (BLDTT) fast summation method for particle interactions. The scheme replaces well-separated particle-particle interactions by adaptively chosen particle-cluster, cluster-particle, and cluster-cluster approximations given by barycentric Lagrange interpolation at proxy particles on a Chebyshev grid in each cluster. The BLDTT is kernel-independent and the approximations can be efficiently mapped onto GPUs, where target particles provide an outer level of parallelism and source particles provide an inner level of parallelism. We present an OpenACC GPU implementation of the BLDTT with MPI remote memory access for distributed memory parallelization. The performance of the GPU-accelerated BLDTT is demonstrated for calculations with different problem sizes, particle distributions, geometric domains, and interaction kernels, as well as for unequal target and source particles. Comparison with our earlier particle-cluster barycentric Lagrange treecode (BLTC) demonstrates the superior performance of the BLDTT. In particular, on a single GPU for problem sizes ranging from $N$=1E5 to 1E8, the BLTC has $O(N\log N)$ scaling, while the BLDTT has $O(N)$ scaling. In addition, MPI strong scaling results are presented for the BLTC and BLDTT using $N$=64E6 particles on up to 32 GPUs.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ physics.comp-ph
R.I.P.
๐ป
Ghosted
R.I.P.
๐ป
Ghosted
Deep Potential Molecular Dynamics: a scalable model with the accuracy of quantum mechanics
R.I.P.
๐ป
Ghosted
Heterogeneous Parallelization and Acceleration of Molecular Dynamics Simulations in GROMACS
R.I.P.
๐ป
Ghosted
By-passing the Kohn-Sham equations with machine learning
R.I.P.
๐ป
Ghosted
Machine Learning of coarse-grained Molecular Dynamics Force Fields
R.I.P.
๐ป
Ghosted
Towards Physics-informed Deep Learning for Turbulent Flow Prediction
Died the same way โ ๐ป Ghosted
R.I.P.
๐ป
Ghosted
Language Models are Few-Shot Learners
R.I.P.
๐ป
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
๐ป
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
๐ป
Ghosted