OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control

November 10, 2024 · Declared Dead · 🏛 2025 IEEE 21st International Conference on Automation Science and Engineering (CASE)

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Rohit Bokade, Xiaoning Jin arXiv ID 2411.06601 Category cs.AI: Artificial Intelligence Cross-listed cs.LG, cs.MA Citations 1 Venue 2025 IEEE 21st International Conference on Automation Science and Engineering (CASE) Last Checked 4 months ago

Abstract

Efficient traffic control (TSC) is essential for urban mobility, but traditional systems struggle to handle the complexity of real-world traffic. Multi-agent Reinforcement Learning (MARL) offers adaptive solutions, but online MARL requires extensive interactions with the environment, making it costly and impractical. Offline MARL mitigates these challenges by using historical traffic data for training but faces significant difficulties with heterogeneous behavior policies in real-world datasets, where mixed-quality data complicates learning. We introduce OffLight, a novel offline MARL framework designed to handle heterogeneous behavior policies in TSC datasets. To improve learning efficiency, OffLight incorporates Importance Sampling (IS) to correct for distributional shifts and Return-Based Prioritized Sampling (RBPS) to focus on high-quality experiences. OffLight utilizes a Gaussian Mixture Variational Graph Autoencoder (GMM-VGAE) to capture the diverse distribution of behavior policies from local observations. Extensive experiments across real-world urban traffic scenarios show that OffLight outperforms existing offline RL methods, achieving up to a 7.8% reduction in average travel time and 11.2% decrease in queue length. Ablation studies confirm the effectiveness of OffLight's components in handling heterogeneous data and improving policy performance. These results highlight OffLight's scalability and potential to improve urban traffic management without the risks of online learning.