Algorithms and data structures for matrix-free finite element operators with MPI-parallel sparse multi-vectors
July 01, 2019 ยท Declared Dead ยท ๐ ACM Transactions on Parallel Computing
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Denis Davydov, Martin Kronbichler
arXiv ID
1907.01005
Category
cs.MS: Mathematical Software
Cross-listed
cs.DS,
math.NA,
physics.comp-ph
Citations
7
Venue
ACM Transactions on Parallel Computing
Last Checked
2 months ago
Abstract
Traditional solution approaches for problems in quantum mechanics scale as $\mathcal O(M^3)$, where $M$ is the number of electrons. Various methods have been proposed to address this issue and obtain linear scaling $\mathcal O(M)$. One promising formulation is the direct minimization of energy. Such methods take advantage of physical localization of the solution, namely that the solution can be sought in terms of non-orthogonal orbitals with local support. In this work a numerically efficient implementation of sparse parallel vectors within the open-source finite element library deal.II is proposed. The main algorithmic ingredient is the matrix-free evaluation of the Hamiltonian operator by cell-wise quadrature. Based on an a-priori chosen support for each vector we develop algorithms and data structures to perform (i) matrix-free sparse matrix multivector products (SpMM), (ii) the projection of an operator onto a sparse sub-space (inner products), and (iii) post-multiplication of a sparse multivector with a square matrix. The node-level performance is analyzed using a roofline model. Our matrix-free implementation of finite element operators with sparse multivectors achieves the performance of 157 GFlop/s on Intel Cascade Lake architecture. Strong and weak scaling results are reported for a typical benchmark problem using quadratic and quartic finite element bases.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Mathematical Software
๐
๐
Old Age
๐
๐
Old Age
CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication
R.I.P.
๐ป
Ghosted
Mathematical Foundations of the GraphBLAS
R.I.P.
๐ป
Ghosted
The DUNE Framework: Basic Concepts and Recent Developments
R.I.P.
๐ป
Ghosted
Format Abstraction for Sparse Tensor Algebra Compilers
R.I.P.
๐ป
Ghosted
AMReX: Block-Structured Adaptive Mesh Refinement for Multiphysics Applications
Died the same way โ ๐ป Ghosted
R.I.P.
๐ป
Ghosted
Language Models are Few-Shot Learners
R.I.P.
๐ป
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
๐ป
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
๐ป
Ghosted