Leveraging Fully Observable Policies for Learning under Partial Observability

November 03, 2022 · Declared Dead · 🏛 Conference on Robot Learning

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Hai Nguyen, Andrea Baisero, Dian Wang, Christopher Amato, Robert Platt arXiv ID 2211.01991 Category cs.RO: Robotics Cross-listed cs.LG Citations 28 Venue Conference on Robot Learning Last Checked 4 months ago

Abstract

Reinforcement learning in partially observable domains is challenging due to the lack of observable state information. Thankfully, learning offline in a simulator with such state information is often possible. In particular, we propose a method for partially observable reinforcement learning that uses a fully observable policy (which we call a state expert) during offline training to improve online performance. Based on Soft Actor-Critic (SAC), our agent balances performing actions similar to the state expert and getting high returns under partial observability. Our approach can leverage the fully-observable policy for exploration and parts of the domain that are fully observable while still being able to learn under partial observability. On six robotics domains, our method outperforms pure imitation, pure reinforcement learning, the sequential or parallel combination of both types, and a recent state-of-the-art method in the same setting. A successful policy transfer to a physical robot in a manipulation task from pixels shows our approach's practicality in learning interesting policies under partial observability.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Robotics

R.I.P. 👻 Ghosted

Past, Present, and Future of Simultaneous Localization And Mapping: Towards the Robust-Perception Age

Cesar Cadena, Luca Carlone, ... (+6 more)

cs.RO 🏛 IEEE TRO 📚 3.2K cites 10 years ago

R.I.P. 👻 Ghosted

AirSim: High-Fidelity Visual and Physical Simulation for Autonomous Vehicles

Shital Shah, Debadeepta Dey, ... (+2 more)

cs.RO 🏛 ICFSR 📚 2.3K cites 9 years ago

📚 📚 The Cartographer

A Survey of Motion Planning and Control Techniques for Self-driving Urban Vehicles

Brian Paden, Michal Cap, ... (+3 more)

cs.RO 🏛 IEEE TIV 📚 2.3K cites 10 years ago

📚 📚 The Cartographer

Unmanned Aerial Vehicles: A Survey on Civil Applications and Key Research Challenges

Hazim Shakhatreh, Ahmad Sawalmeh, ... (+7 more)

cs.RO 🏛 arXiv 📚 1.8K cites 8 years ago

📚 📚 The Cartographer

A Survey of Autonomous Driving: Common Practices and Emerging Technologies

Ekim Yurtsever, Jacob Lambert, ... (+2 more)

cs.RO 🏛 IEEE Access 📚 1.7K cites 7 years ago

R.I.P. 👻 Ghosted

Learning agile and dynamic motor skills for legged robots

Jemin Hwangbo, Joonho Lee, ... (+5 more)

cs.RO 🏛 Sci. Robot. 📚 1.6K cites 7 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 9 years ago