Maximum Entropy competes with Maximum Likelihood

December 17, 2020 · Declared Dead · 🏛 arXiv.org

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors A. E. Allahverdyan, N. H. Martirosyan arXiv ID 2012.09430 Category physics.data-an Cross-listed cond-mat.stat-mech, cs.LG, stat.ML Citations 0 Venue arXiv.org Last Checked 3 months ago

Abstract

Maximum entropy (MAXENT) method has a large number of applications in theoretical and applied machine learning, since it provides a convenient non-parametric tool for estimating unknown probabilities. The method is a major contribution of statistical physics to probabilistic inference. However, a systematic approach towards its validity limits is currently missing. Here we study MAXENT in a Bayesian decision theory set-up, i.e. assuming that there exists a well-defined prior Dirichlet density for unknown probabilities, and that the average Kullback-Leibler (KL) distance can be employed for deciding on the quality and applicability of various estimators. These allow to evaluate the relevance of various MAXENT constraints, check its general applicability, and compare MAXENT with estimators having various degrees of dependence on the prior, viz. the regularized maximum likelihood (ML) and the Bayesian estimators. We show that MAXENT applies in sparse data regimes, but needs specific types of prior information. In particular, MAXENT can outperform the optimally regularized ML provided that there are prior rank correlations between the estimated random quantity and its probabilities.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — physics.data-an

R.I.P. 👻 Ghosted

ROOT - A C++ Framework for Petabyte Data Storage, Statistical Analysis and Visualization

Ilka Antcheva, Maarten Ballintijn, ... (+25 more)

physics.data-an 🏛 Computer Physics Communications 📚 716 cites 10 years ago

R.I.P. 👻 Ghosted

A deep convolutional neural network approach to single-particle recognition in cryo-electron microscopy

Yanan Zhu, Qi Ouyang, Youdong Mao

physics.data-an 🏛 BMC Bioinformatics 📚 136 cites 10 years ago

R.I.P. 👻 Ghosted

The Pandora Software Development Kit for Pattern Recognition

J. S. Marshall, M. A. Thomson

physics.data-an 🏛 The European Physical Journal C 📚 128 cites 11 years ago

R.I.P. 👻 Ghosted

Emergence of Compositional Representations in Restricted Boltzmann Machines

Jérôme Tubiana, Rémi Monasson

physics.data-an 🏛 Phys. Rev. Lett. 📚 99 cites 9 years ago

R.I.P. 👻 Ghosted

Investigating echo state networks dynamics by means of recurrence analysis

Filippo Maria Bianchi, Lorenzo Livi, Cesare Alippi

physics.data-an 🏛 IEEE TNNLS 📚 93 cites 10 years ago

R.I.P. 👻 Ghosted

Discovering state-parameter mappings in subsurface models using generative adversarial networks

Alexander Y. Sun

physics.data-an 🏛 Geophysical Research Letters 📚 85 cites 7 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 8 years ago