Accurate Estimation of Mutual Information in High Dimensional Data
May 31, 2025 Β· Declared Dead Β· π arXiv.org
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Eslam Abdelaleem, K. Michael Martini, Ilya Nemenman
arXiv ID
2506.00330
Category
physics.data-an
Cross-listed
cs.IT,
stat.ML
Citations
4
Venue
arXiv.org
Last Checked
3 months ago
Abstract
Mutual information (MI) is a fundamental measure of statistical dependence between two variables, yet accurate estimation from finite data remains notoriously difficult. No estimator is universally reliable, and common approaches fail in the high-dimensional, undersampled regimes typical of modern experiments. Recent machine learning-based estimators show promise, but their accuracy depends sensitively on dataset size, structure, and hyperparameters, with no accepted tests to detect failures. We close these gaps through a systematic evaluation of classical and neural MI estimators across standard benchmarks and new synthetic datasets tailored to challenging high-dimensional, undersampled regimes. We contribute: (i) a practical protocol for reliable MI estimation with explicit checks for statistical consistency; (ii) confidence intervals (error bars around estimates) that existing neural MI estimator do not provide; and (iii) a new class of probabilistic critics designed for high-dimensional, high-information settings. We demonstrate the effectiveness of our protocol with computational experiments, showing that it consistently matches or surpasses existing methods while uniquely quantifying its own reliability. We show that reliable MI estimation is sometimes achievable even in severely undersampled, high-dimensional datasets, provided they admit accurate low-dimensional representations. This broadens the scope of applicability of neural MI estimators and clarifies when such estimators can be trusted.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β physics.data-an
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
A deep convolutional neural network approach to single-particle recognition in cryo-electron microscopy
R.I.P.
π»
Ghosted
The Pandora Software Development Kit for Pattern Recognition
R.I.P.
π»
Ghosted
Emergence of Compositional Representations in Restricted Boltzmann Machines
R.I.P.
π»
Ghosted
Investigating echo state networks dynamics by means of recurrence analysis
R.I.P.
π»
Ghosted
Discovering state-parameter mappings in subsurface models using generative adversarial networks
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted