Convergence Analysis of Gradient EM for Multi-component Gaussian Mixture

May 23, 2017 · Declared Dead · 🏛 Neural Information Processing Systems

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Bowei Yan, Mingzhang Yin, Purnamrita Sarkar arXiv ID 1705.08530 Category math.ST Cross-listed cs.LG Citations 1 Venue Neural Information Processing Systems Last Checked 4 months ago

Abstract

In this paper, we study convergence properties of the gradient Expectation-Maximization algorithm \cite{lange1995gradient} for Gaussian Mixture Models for general number of clusters and mixing coefficients. We derive the convergence rate depending on the mixing coefficients, minimum and maximum pairwise distances between the true centers and dimensionality and number of components; and obtain a near-optimal local contraction radius. While there have been some recent notable works that derive local convergence rates for EM in the two equal mixture symmetric GMM, in the more general case, the derivations need structurally different and non-trivial arguments. We use recent tools from learning theory and empirical processes to achieve our theoretical results.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — math.ST

R.I.P. 👻 Ghosted

Nonparametric regression using deep neural networks with ReLU activation function

Johannes Schmidt-Hieber

math.ST 🏛 Annals of Statistics 📚 949 cites 8 years ago

R.I.P. 👻 Ghosted

An introduction to Topological Data Analysis: fundamental and practical aspects for data scientists

Frédéric Chazal, Bertrand Michel

math.ST 🏛 AI 📚 727 cites 8 years ago

R.I.P. 👻 Ghosted

Minimax Optimal Procedures for Locally Private Estimation

John Duchi, Martin Wainwright, Michael Jordan

math.ST 🏛 arXiv 📚 481 cites 10 years ago

R.I.P. 👻 Ghosted

Optimal Best Arm Identification with Fixed Confidence

Aurélien Garivier, Emilie Kaufmann

math.ST 🏛 COLT 📚 384 cites 10 years ago

R.I.P. 👻 Ghosted

Fast low-rank estimation by projected gradient descent: General statistical and algorithmic guarantees

Yudong Chen, Martin J. Wainwright

math.ST 🏛 arXiv 📚 329 cites 10 years ago

R.I.P. 👻 Ghosted

User-friendly guarantees for the Langevin Monte Carlo with inaccurate gradient

Arnak S. Dalalyan, Avetik G. Karagulyan

math.ST 🏛 Stochastic Processes and their Applications 📚 319 cites 8 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 9 years ago