A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit
October 02, 2015 Β· The Cartographer Β· π arXiv.org
"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit"
Evidence collected by the PWNC Scanner
Authors
Giuseppe Burtini, Jason Loeppky, Ramon Lawrence
arXiv ID
1510.00757
Category
stat.ML: Machine Learning (Stat)
Cross-listed
cs.LG
Citations
125
Venue
arXiv.org
Last Checked
1 day ago
Abstract
Adaptive and sequential experiment design is a well-studied area in numerous domains. We survey and synthesize the work of the online statistical learning paradigm referred to as multi-armed bandits integrating the existing research as a resource for a certain class of online experiments. We first explore the traditional stochastic model of a multi-armed bandit, then explore a taxonomic scheme of complications to that model, for each complication relating it to a specific requirement or consideration of the experiment design context. Finally, at the end of the paper, we present a table of known upper-bounds of regret for all studied algorithms providing both perspectives for future theoretical work and a decision-making tool for practitioners looking for theoretical guarantees.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Machine Learning (Stat)
ποΈ
ποΈ
Transcended
ποΈ
ποΈ
Transcended
Layer Normalization
ποΈ
ποΈ
Transcended
Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles
R.I.P.
π»
Ghosted
Variational Inference with Normalizing Flows
π
π
The Cartographer
Towards A Rigorous Science of Interpretable Machine Learning
R.I.P.
π»
Ghosted