A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit

October 02, 2015 Β· The Cartographer Β· πŸ› arXiv.org

πŸ“š THE CARTOGRAPHER: The Cartographer
Survey/review paper β€” maps the landscape rather than implementing a method.

"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit"

Evidence collected by the PWNC Scanner

Authors Giuseppe Burtini, Jason Loeppky, Ramon Lawrence arXiv ID 1510.00757 Category stat.ML: Machine Learning (Stat) Cross-listed cs.LG Citations 125 Venue arXiv.org Last Checked 1 day ago
Abstract
Adaptive and sequential experiment design is a well-studied area in numerous domains. We survey and synthesize the work of the online statistical learning paradigm referred to as multi-armed bandits integrating the existing research as a resource for a certain class of online experiments. We first explore the traditional stochastic model of a multi-armed bandit, then explore a taxonomic scheme of complications to that model, for each complication relating it to a specific requirement or consideration of the experiment design context. Finally, at the end of the paper, we present a table of known upper-bounds of regret for all studied algorithms providing both perspectives for future theoretical work and a decision-making tool for practitioners looking for theoretical guarantees.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Machine Learning (Stat)

πŸ›οΈ πŸ›οΈ Transcended

Layer Normalization

Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton

stat.ML πŸ› arXiv πŸ“š 12.0K cites 9 years ago