A Near-optimal Algorithm for Learning Margin Halfspaces with Massart Noise
January 16, 2025 ยท Declared Dead ยท ๐ Neural Information Processing Systems
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Ilias Diakonikolas, Nikos Zarifis
arXiv ID
2501.09691
Category
cs.LG: Machine Learning
Cross-listed
cs.DS,
math.ST,
stat.ML
Citations
4
Venue
Neural Information Processing Systems
Last Checked
4 months ago
Abstract
We study the problem of PAC learning $ฮณ$-margin halfspaces in the presence of Massart noise. Without computational considerations, the sample complexity of this learning problem is known to be $\widetildeฮ(1/(ฮณ^2 ฮต))$. Prior computationally efficient algorithms for the problem incur sample complexity $\tilde{O}(1/(ฮณ^4 ฮต^3))$ and achieve 0-1 error of $ฮท+ฮต$, where $ฮท<1/2$ is the upper bound on the noise rate. Recent work gave evidence of an information-computation tradeoff, suggesting that a quadratic dependence on $1/ฮต$ is required for computationally efficient algorithms. Our main result is a computationally efficient learner with sample complexity $\widetildeฮ(1/(ฮณ^2 ฮต^2))$, nearly matching this lower bound. In addition, our algorithm is simple and practical, relying on online SGD on a carefully selected sequence of convex losses.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Machine Learning
๐ฎ
๐ฎ
The Ethereal
๐ฎ
๐ฎ
The Ethereal
Continuous control with deep reinforcement learning
๐
๐
Old Age
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
๐
๐
Old Age
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
๐
๐
Old Age
SGDR: Stochastic Gradient Descent with Warm Restarts
๐ฎ
๐ฎ
The Ethereal
Asynchronous Methods for Deep Reinforcement Learning
Died the same way โ ๐ป Ghosted
R.I.P.
๐ป
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
๐ป
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
๐ป
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
๐ป
Ghosted