Learning Memory Mechanisms for Decision Making through Demonstrations

November 12, 2024 · Entered Twilight · 🏛 arXiv.org

Repo contents: .gitignore, LICENSE, README.md, environment.yml, ltmb, memory_gym, requirements.txt

Authors William Yue, Bo Liu, Peter Stone arXiv ID 2411.07954 Category cs.LG: Machine Learning Cross-listed cs.RO Citations 4 Venue arXiv.org Repository https://github.com/WilliamYue37/AttentionTuner ⭐ 2 Last Checked 3 months ago

Abstract

In Partially Observable Markov Decision Processes, integrating an agent's history into memory poses a significant challenge for decision-making. Traditional imitation learning, relying on observation-action pairs for expert demonstrations, fails to capture the expert's memory mechanisms used in decision-making. To capture memory processes as demonstrations, we introduce the concept of memory dependency pairs $(p, q)$ indicating that events at time $p$ are recalled for decision-making at time $q$. We introduce AttentionTuner to leverage memory dependency pairs in Transformers and find significant improvements across several tasks compared to standard Transformers when evaluated on Memory Gym and the Long-term Memory Benchmark. Code is available at https://github.com/WilliamYue37/AttentionTuner.