A Review of Cooperative Multi-Agent Deep Reinforcement Learning
August 11, 2019 Β· The Cartographer Β· π Applied intelligence (Boston)
"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Review of Cooperative Multi-Agent Deep Reinforcement Learning"
Evidence collected by the PWNC Scanner
Authors
Afshin OroojlooyJadid, Davood Hajinezhad
arXiv ID
1908.03963
Category
cs.LG: Machine Learning
Cross-listed
cs.AI,
cs.MA,
math.OC,
stat.ML
Citations
560
Venue
Applied intelligence (Boston)
Last Checked
1 day ago
Abstract
Deep Reinforcement Learning has made significant progress in multi-agent systems in recent years. In this review article, we have focused on presenting recent approaches on Multi-Agent Reinforcement Learning (MARL) algorithms. In particular, we have focused on five common approaches on modeling and solving cooperative multi-agent reinforcement learning problems: (I) independent learners, (II) fully observable critic, (III) value function factorization, (IV) consensus, and (IV) learn to communicate. First, we elaborate on each of these methods, possible challenges, and how these challenges were mitigated in the relevant papers. If applicable, we further make a connection among different papers in each category. Next, we cover some new emerging research areas in MARL along with the relevant recent papers. Due to the recent success of MARL in real-world applications, we assign a section to provide a review of these applications and corresponding articles. Also, a list of available environments for MARL research is provided in this survey. Finally, the paper is concluded with proposals on the possible research directions.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Machine Learning
ποΈ
ποΈ
Transcended
ποΈ
ποΈ
Transcended
Continuous control with deep reinforcement learning
π
π
Old Age
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
π
π
Old Age
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
π
π
Old Age
SGDR: Stochastic Gradient Descent with Warm Restarts
ποΈ
ποΈ
Transcended