Information theoretic limits of learning a sparse rule

June 19, 2020 · Declared Dead · 🏛 Neural Information Processing Systems

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Clément Luneau, Jean Barbier, Nicolas Macris arXiv ID 2006.11313 Category cs.IT: Information Theory Cross-listed cs.LG, stat.ML Citations 12 Venue Neural Information Processing Systems Last Checked 4 months ago

Abstract

We consider generalized linear models in regimes where the number of nonzero components of the signal and accessible data points are sublinear with respect to the size of the signal. We prove a variational formula for the asymptotic mutual information per sample when the system size grows to infinity. This result allows us to derive an expression for the minimum mean-square error (MMSE) of the Bayesian estimator when the signal entries have a discrete distribution with finite support. We find that, for such signals and suitable vanishing scalings of the sparsity and sampling rate, the MMSE is nonincreasing piecewise constant. In specific instances the MMSE even displays an all-or-nothing phase transition, that is, the MMSE sharply jumps from its maximum value to zero at a critical sampling rate. The all-or-nothing phenomenon has previously been shown to occur in high-dimensional linear regression. Our analysis goes beyond the linear case and applies to learning the weights of a perceptron with general activation function in a teacher-student scenario. In particular, we discuss an all-or-nothing phenomenon for the generalization error with a sublinear set of training examples.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Information Theory

R.I.P. 👻 Ghosted

Intelligent Reflecting Surface Enhanced Wireless Network via Joint Active and Passive Beamforming

Qingqing Wu, Rui Zhang

cs.IT 🏛 IEEE TWC 📚 3.8K cites 7 years ago

R.I.P. 👻 Ghosted

A Vision of 6G Wireless Systems: Applications, Trends, Technologies, and Open Research Problems

Walid Saad, Mehdi Bennis, Mingzhe Chen

cs.IT 🏛 Network 📚 3.8K cites 7 years ago

R.I.P. 👻 Ghosted

Towards Smart and Reconfigurable Environment: Intelligent Reflecting Surface Aided Wireless Network

Qingqing Wu, Rui Zhang

cs.IT 🏛 IEEE CommMag 📚 3.6K cites 7 years ago

📚 📚 The Cartographer

Wireless Communications with Unmanned Aerial Vehicles: Opportunities and Challenges

Yong Zeng, Rui Zhang, Teng Joon Lim

cs.IT 🏛 IEEE CommMag 📚 3.4K cites 10 years ago

R.I.P. 👻 Ghosted

Reconfigurable Intelligent Surfaces for Energy Efficiency in Wireless Communication

Chongwen Huang, Alessio Zappone, ... (+3 more)

cs.IT 🏛 IEEE TWC 📚 2.7K cites 7 years ago

📚 📚 The Cartographer

An Overview of Signal Processing Techniques for Millimeter Wave MIMO Systems

Robert W. Heath, Nuria Gonzalez-Prelcic, ... (+3 more)

cs.IT 🏛 IEEE JSTSP 📚 2.6K cites 10 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 9 years ago