Learning Two-agent Motion Planning Strategies from Generalized Nash Equilibrium for Model Predictive Control
November 21, 2024 Β· Declared Dead Β· π Conference on Learning for Dynamics & Control
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Hansung Kim, Edward L. Zhu, Chang Seok Lim, Francesco Borrelli
arXiv ID
2411.13983
Category
cs.MA: Multiagent Systems
Cross-listed
cs.RO,
eess.SY
Citations
3
Venue
Conference on Learning for Dynamics & Control
Last Checked
3 months ago
Abstract
We introduce an Implicit Game-Theoretic MPC (IGT-MPC), a decentralized algorithm for two-agent motion planning that uses a learned value function that predicts the game-theoretic interaction outcomes as the terminal cost-to-go function in a model predictive control (MPC) framework, guiding agents to implicitly account for interactions with other agents and maximize their reward. This approach applies to competitive and cooperative multi-agent motion planning problems which we formulate as constrained dynamic games. Given a constrained dynamic game, we randomly sample initial conditions and solve for the generalized Nash equilibrium (GNE) to generate a dataset of GNE solutions, computing the reward outcome of each game-theoretic interaction from the GNE. The data is used to train a simple neural network to predict the reward outcome, which we use as the terminal cost-to-go function in an MPC scheme. We showcase emerging competitive and coordinated behaviors using IGT-MPC in scenarios such as two-vehicle head-to-head racing and un-signalized intersection navigation. IGT-MPC offers a novel method integrating machine learning and game-theoretic reasoning into model-based decentralized multi-agent motion planning.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Multiagent Systems
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Mean Field Multi-Agent Reinforcement Learning
π
π
The Cartographer
A Survey and Critique of Multiagent Deep Reinforcement Learning
π
π
The Cartographer
A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity
π
π
The Cartographer
Collaborative vehicle routing: a survey
R.I.P.
π»
Ghosted
Deep Reinforcement Learning for Swarm Systems
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted