Reinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions

April 16, 2020 · Declared Dead · 🏛 Robotics: Science and Systems

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Jason Choi, Fernando Castañeda, Claire J. Tomlin, Koushil Sreenath arXiv ID 2004.07584 Category eess.SY: Systems & Control (EE) Cross-listed cs.LG, cs.RO Citations 217 Venue Robotics: Science and Systems Last Checked 2 months ago

Abstract

In this paper, the issue of model uncertainty in safety-critical control is addressed with a data-driven approach. For this purpose, we utilize the structure of an input-ouput linearization controller based on a nominal model along with a Control Barrier Function and Control Lyapunov Function based Quadratic Program (CBF-CLF-QP). Specifically, we propose a novel reinforcement learning framework which learns the model uncertainty present in the CBF and CLF constraints, as well as other control-affine dynamic constraints in the quadratic program. The trained policy is combined with the nominal model-based CBF-CLF-QP, resulting in the Reinforcement Learning-based CBF-CLF-QP (RL-CBF-CLF-QP), which addresses the problem of model uncertainty in the safety constraints. The performance of the proposed method is validated by testing it on an underactuated nonlinear bipedal robot walking on randomly spaced stepping stones with one step preview, obtaining stable and safe walking under model uncertainty.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — Systems & Control (EE)

R.I.P. 👻 Ghosted

A Tutorial on Modeling and Analysis of Dynamic Social Networks. Part I

Anton V. Proskurnikov, Roberto Tempo

eess.SY 🏛 Annual Reviews in Control 📚 560 cites 9 years ago

R.I.P. 👻 Ghosted

Incremental Gradient, Subgradient, and Proximal Methods for Convex Optimization: A Survey

Dimitri P. Bertsekas

eess.SY 🏛 arXiv 📚 454 cites 10 years ago

R.I.P. 👻 Ghosted

Wireless Network Design for Control Systems: A Survey

Pangun Park, Sinem Coleri Ergen, ... (+3 more)

eess.SY 🏛 IEEE COMST 📚 447 cites 8 years ago

R.I.P. 👻 Ghosted

Learning-based Model Predictive Control for Safe Exploration

Torsten Koller, Felix Berkenkamp, ... (+2 more)

eess.SY 🏛 CDC 📚 412 cites 8 years ago

R.I.P. 👻 Ghosted

Safety-Critical Model Predictive Control with Discrete-Time Control Barrier Function

Jun Zeng, Bike Zhang, Koushil Sreenath

eess.SY 🏛 ACC 📚 388 cites 5 years ago

R.I.P. 👻 Ghosted

Novel Multidimensional Models of Opinion Dynamics in Social Networks

Sergey E. Parsegov, Anton V. Proskurnikov, ... (+2 more)

eess.SY 🏛 IEEE TAC 📚 372 cites 10 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, ... (+29 more)

cs.CL 🏛 NeurIPS 📚 54.2K cites 5 years ago

R.I.P. 👻 Ghosted

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, Sam Gross, ... (+19 more)

cs.LG 🏛 NeurIPS 📚 49.7K cites 6 years ago

R.I.P. 👻 Ghosted

XGBoost: A Scalable Tree Boosting System

Tianqi Chen, Carlos Guestrin

cs.LG 🏛 KDD 📚 49.2K cites 10 years ago

R.I.P. 👻 Ghosted

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, Christian Szegedy

cs.LG 🏛 ICML 📚 46.0K cites 11 years ago