Stella Nera: A Differentiable Maddness-Based Hardware Accelerator for Efficient Approximate Matrix Multiplication

November 16, 2023 · Declared Dead · 🏛 IEEE Computer Society Annual Symposium on VLSI

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Jannis Schönleber, Lukas Cavigelli, Matteo Perotti, Luca Benini, Renzo Andri arXiv ID 2311.10207 Category cs.AR: Hardware Architecture Cross-listed cs.CV, cs.LG, stat.ML Citations 1 Venue IEEE Computer Society Annual Symposium on VLSI Last Checked 3 months ago

Abstract

Artificial intelligence has surged in recent years, with advancements in machine learning rapidly impacting nearly every area of life. However, the growing complexity of these models has far outpaced advancements in available hardware accelerators, leading to significant computational and energy demands, primarily due to matrix multiplications, which dominate the compute workload. Maddness (i.e., Multiply-ADDitioN-lESS) presents a hash-based version of product quantization, which renders matrix multiplications into lookups and additions, eliminating the need for multipliers entirely. We present Stella Nera, the first Maddness-based accelerator achieving an energy efficiency of 161 TOp/s/W@0.55V, 25x better than conventional MatMul accelerators due to its small components and reduced computational complexity. We further enhance Maddness with a differentiable approximation, allowing for gradient-based fine-tuning and achieving an end-to-end performance of 92.5% Top-1 accuracy on CIFAR-10.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Hardware Architecture

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Corona: System Implications of Emerging Nanophotonic Technology

Dana Vantrease, Robert Schreiber, ... (+8 more)

cs.AR 🏛 ISCA 📚 710 cites 2 years ago

R.I.P. 👻 Ghosted

A scalable multi-core architecture with heterogeneous memory structures for Dynamic Neuromorphic Asynchronous Processors (DYNAPs)

Saber Moradi, Ning Qiao, ... (+2 more)

cs.AR 🏛 IEEE TBCS 📚 544 cites 8 years ago

R.I.P. 👻 Ghosted

SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

Hanrui Wang, Zhekai Zhang, Song Han

cs.AR 🏛 ISCA 📚 503 cites 5 years ago

R.I.P. 👻 Ghosted

Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks

Charles Eckert, Xiaowei Wang, ... (+6 more)

cs.AR 🏛 ISCA 📚 373 cites 8 years ago

R.I.P. 👻 Ghosted

SpArch: Efficient Architecture for Sparse Matrix Multiplication

Zhekai Zhang, Hanrui Wang, ... (+2 more)

cs.AR 🏛 ISCA 📚 274 cites 6 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 8 years ago

R.I.P. 👻 Ghosted

Equality of Opportunity in Supervised Learning

Moritz Hardt, Eric Price, Nathan Srebro

cs.LG 🏛 NeurIPS 📚 4.9K cites 9 years ago