Fast multiplication of random dense matrices with fixed sparse matrices

October 24, 2023 · Declared Dead · 🏛 arXiv.org

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Tianyu Liang, Riley Murray, Aydın Buluç, James Demmel arXiv ID 2310.15419 Category cs.CE: Computational Engineering Cross-listed cs.DC, cs.DS Citations 0 Venue arXiv.org Last Checked 2 months ago

Abstract

This work focuses on accelerating the multiplication of a dense random matrix with a (fixed) sparse matrix, which is frequently used in sketching algorithms. We develop a novel scheme that takes advantage of blocking and recomputation (on-the-fly random number generation) to accelerate this operation. The techniques we propose decrease memory movement, thereby increasing the algorithm's parallel scalability in shared memory architectures. On the Intel Frontera architecture, our algorithm can achieve 2x speedups over libraries such as Eigen and Intel MKL on some examples. In addition, with 32 threads, we can obtain a parallel efficiency of up to approximately 45%. We also present a theoretical analysis for the memory movement lower bound of our algorithm, showing that under mild assumptions, it's possible to beat the data movement lower bound of general matrix-matrix multiply (GEMM) by a factor of $\sqrt M$, where $M$ is the cache size. Finally, we incorporate our sketching algorithm into a randomized least squares solver. For extremely over-determined sparse input matrices, we show that our results are competitive with SuiteSparse; in some cases, we obtain a speedup of 10x over SuiteSparse.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Computational Engineering

R.I.P. 👻 Ghosted

Temporal Relational Ranking for Stock Prediction

Fuli Feng, Xiangnan He, ... (+4 more)

cs.CE 🏛 ACM TOIS 📚 485 cites 7 years ago

R.I.P. 👻 Ghosted

A Probabilistic Graphical Model Foundation for Enabling Predictive Digital Twins at Scale

Michael G. Kapteyn, Jacob V. R. Pretorius, Karen E. Willcox

cs.CE 🏛 Nature Computational Science 📚 277 cites 5 years ago

R.I.P. 👻 Ghosted

Temporal Attention augmented Bilinear Network for Financial Time-Series Data Analysis

Dat Thanh Tran, Alexandros Iosifidis, ... (+2 more)

cs.CE 🏛 IEEE TNNLS 📚 222 cites 8 years ago

R.I.P. 👻 Ghosted

Linked Component Analysis from Matrices to High Order Tensors: Applications to Biomedical Data

Guoxu Zhou, Qibin Zhao, ... (+4 more)

cs.CE 🏛 Proc. IEEE 📚 190 cites 10 years ago

R.I.P. 👻 Ghosted

Deep Dynamical Modeling and Control of Unsteady Fluid Flows

Jeremy Morton, Freddie D. Witherden, ... (+2 more)

cs.CE 🏛 NeurIPS 📚 181 cites 7 years ago

R.I.P. 👻 Ghosted

Design and Optimization of Conforming Lattice Structures

Jun Wu, Weiming Wang, Xifeng Gao

cs.CE 🏛 IEEE TVCG 📚 158 cites 6 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago