Generalization Bounds for Uniformly Stable Algorithms
December 24, 2018 ยท Declared Dead ยท ๐ Neural Information Processing Systems
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Vitaly Feldman, Jan Vondrak
arXiv ID
1812.09859
Category
cs.LG: Machine Learning
Cross-listed
cs.DS,
stat.ML
Citations
96
Venue
Neural Information Processing Systems
Last Checked
3 months ago
Abstract
Uniform stability of a learning algorithm is a classical notion of algorithmic stability introduced to derive high-probability bounds on the generalization error (Bousquet and Elisseeff, 2002). Specifically, for a loss function with range bounded in $[0,1]$, the generalization error of a $ฮณ$-uniformly stable learning algorithm on $n$ samples is known to be within $O((ฮณ+1/n) \sqrt{n \log(1/ฮด)})$ of the empirical error with probability at least $1-ฮด$. Unfortunately, this bound does not lead to meaningful generalization bounds in many common settings where $ฮณ\geq 1/\sqrt{n}$. At the same time the bound is known to be tight only when $ฮณ= O(1/n)$. We substantially improve generalization bounds for uniformly stable algorithms without making any additional assumptions. First, we show that the bound in this setting is $O(\sqrt{(ฮณ+ 1/n) \log(1/ฮด)})$ with probability at least $1-ฮด$. In addition, we prove a tight bound of $O(ฮณ^2 + 1/n)$ on the second moment of the estimation error. The best previous bound on the second moment is $O(ฮณ+ 1/n)$. Our proofs are based on new analysis techniques and our results imply substantially stronger generalization guarantees for several well-studied algorithms.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
๐ฎ
๐ฎ
The Ethereal
๐ฎ
๐ฎ
The Ethereal
Continuous control with deep reinforcement learning
๐
๐
Old Age
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
๐
๐
Old Age
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
๐
๐
Old Age
SGDR: Stochastic Gradient Descent with Warm Restarts
๐ฎ
๐ฎ
The Ethereal
Asynchronous Methods for Deep Reinforcement Learning
Died the same way โ ๐ป Ghosted
R.I.P.
๐ป
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
๐ป
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
๐ป
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
๐ป
Ghosted