Efficient Online Bandit Multiclass Learning with $\tilde{O}(\sqrt{T})$ Regret

February 25, 2017 ยท Declared Dead ยท ๐Ÿ› International Conference on Machine Learning

๐Ÿ‘ป CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Alina Beygelzimer, Francesco Orabona, Chicheng Zhang arXiv ID 1702.07958 Category cs.LG: Machine Learning Cross-listed stat.ML Citations 21 Venue International Conference on Machine Learning Last Checked 4 months ago
Abstract
We present an efficient second-order algorithm with $\tilde{O}(\frac{1}ฮท\sqrt{T})$ regret for the bandit online multiclass problem. The regret bound holds simultaneously with respect to a family of loss functions parameterized by $ฮท$, for a range of $ฮท$ restricted by the norm of the competitor. The family of loss functions ranges from hinge loss ($ฮท=0$) to squared hinge loss ($ฮท=1$). This provides a solution to the open problem of (J. Abernethy and A. Rakhlin. An efficient bandit algorithm for $\sqrt{T}$-regret in online multiclass prediction? In COLT, 2009). We test our algorithm experimentally, showing that it also performs favorably against earlier algorithms.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Machine Learning

Died the same way โ€” ๐Ÿ‘ป Ghosted