๐ฎ
๐ฎ
The Ethereal
Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling
April 26, 2022 ยท Entered Twilight ยท ๐ IEEE Congress on Evolutionary Computation
Repo contents: .github, .gitignore, .readthedocs.yml, .travis.yml, LICENSE, MANIFEST.in, README.md, banditzoo, coverage.sh, coverage.svg, docs, examples, head, requirements.txt, setup.cfg, setup.py, tests
Authors
Baihan Lin
arXiv ID
2205.10113
Category
cs.NE: Neural & Evolutionary
Cross-listed
cs.AI,
cs.LG,
cs.MA,
stat.ML
Citations
6
Venue
IEEE Congress on Evolutionary Computation
Repository
https://github.com/doerlbh/BanditZoo
โญ 7
Last Checked
3 months ago
Abstract
As two popular schools of machine learning, online learning and evolutionary computations have become two important driving forces behind real-world decision making engines for applications in biomedicine, economics, and engineering fields. Although there are prior work that utilizes bandits to improve evolutionary algorithms' optimization process, it remains a field of blank on how evolutionary approach can help improve the sequential decision making tasks of online learning agents such as the multi-armed bandits. In this work, we propose the Genetic Thompson Sampling, a bandit algorithm that keeps a population of agents and update them with genetic principles such as elite selection, crossover and mutations. Empirical results in multi-armed bandit simulation environments and a practical epidemic control problem suggest that by incorporating the genetic algorithm into the bandit algorithm, our method significantly outperforms the baselines in nonstationary settings. Lastly, we introduce EvoBandit, a web-based interactive visualization to guide the readers through the entire learning process and perform lightweight evaluations on the fly. We hope to engage researchers into this growing field of research with this investigation.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
๐ Similar Papers
In the same crypt โ Neural & Evolutionary
R.I.P.
๐ป
Ghosted
Deep Learning using Rectified Linear Units (ReLU)
R.I.P.
๐ป
Ghosted
Generative Adversarial Text to Image Synthesis
R.I.P.
๐ป
Ghosted
Regularized Evolution for Image Classifier Architecture Search
R.I.P.
๐ป
Ghosted
Temporal Ensembling for Semi-Supervised Learning
๐
๐
Old Age