Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applications
December 31, 2018 Β· The Cartographer Β· π IEEE Transactions on Cybernetics
"No code URL or promise found in abstract"
"Title-pattern auto-detect: Deep Reinforcement Learning for Multi-Agent Systems: A Review of Challenges, Solutions and Applicati"
Evidence collected by the PWNC Scanner
Authors
Thanh Thi Nguyen, Ngoc Duy Nguyen, Saeid Nahavandi
arXiv ID
1812.11794
Category
cs.LG: Machine Learning
Cross-listed
cs.AI,
cs.MA,
stat.ML
Citations
955
Venue
IEEE Transactions on Cybernetics
Last Checked
1 day ago
Abstract
Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms however have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in these challenging environments. This paper addresses an important aspect of deep RL related to situations that require multiple agents to communicate and cooperate to solve complex tasks. A survey of different approaches to problems related to multi-agent deep RL (MADRL) is presented, including non-stationarity, partial observability, continuous state and action spaces, multi-agent training schemes, multi-agent transfer learning. The merits and demerits of the reviewed methods will be analyzed and discussed, with their corresponding applications explored. It is envisaged that this review provides insights about various MADRL methods and can lead to future development of more robust and highly useful multi-agent learning methods for solving real-world problems.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Machine Learning
ποΈ
ποΈ
Transcended
ποΈ
ποΈ
Transcended
Continuous control with deep reinforcement learning
π
π
Old Age
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
π
π
Old Age
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
π
π
Old Age
SGDR: Stochastic Gradient Descent with Warm Restarts
ποΈ
ποΈ
Transcended