Recursively Feasible Probabilistic Safe Online Learning with Control Barrier Functions
Abstract: Learning-based control has recently shown great efficacy in performing complex tasks for various applications. However, to deploy it in real systems, it is of vital importance to guarantee the system will stay safe. Control Barrier Functions (CBFs) offer mathematical tools for designing safety-preserving controllers for systems with known dynamics. In this article, we first introduce a model-uncertainty-aware reformulation of CBF-based safety-critical controllers using Gaussian Process (GP) regression to close the gap between an approximate mathematical model and the real system, which results in a second-order cone program (SOCP)-based control design. We then present the pointwise feasibility conditions of the resulting safety controller, highlighting the level of richness that the available system information must meet to ensure safety. We use these conditions to devise an event-triggered online data collection strategy that ensures the recursive feasibility of the learned safety controller. Our method works by constantly reasoning about whether the current information is sufficient to ensure safety or if new measurements under active safe exploration are required to reduce the uncertainty. As a result, our proposed framework can guarantee the forward invariance of the safe set defined by the CBF with high probability, even if it contains a priori unexplored regions. We validate the proposed framework in two numerical simulation experiments.
- A.Ā D. Ames, X.Ā Xu, J.Ā W. Grizzle, and P.Ā Tabuada, āControl barrier function based quadratic programs for safety critical systems,ā IEEE Transactions on Automatic Control, vol.Ā 62, pp. 3861ā3876, 2017.
- S.Ā Bansal, M.Ā Chen, S.Ā Herbert, and C.Ā J. Tomlin, āHamilton-jacobi reachability: A brief overview and recent advances,ā in 2017 56th IEEE Conference on Decision and Control (CDC), 2017.
- K.Ā P. Wabersich and M.Ā N. Zeilinger, āA predictive safety filter for learning-based control of constrained nonlinear dynamical systems,ā Automatica, vol. 129, p. 109597, 2021.
- Q.Ā Nguyen and K.Ā Sreenath, āL1 adaptive control for bipedal robots with control lyapunov function based quadratic programs,ā in American Control Conference, Chicago, IL, July 2015, pp. 862ā867.
- A.Ā J. Taylor and A.Ā D. Ames, āAdaptive safety with control barrier functions,ā in American Control Conference, 2020, pp. 1399ā1405.
- B.Ā T. Lopez, J.-J.Ā E. Slotine, and J.Ā P. How, āRobust adaptive control barrier functions: An adaptive and data-driven approach to safety,ā IEEE Control Systems Letters, vol.Ā 5, no.Ā 3, pp. 1031ā1036, 2020.
- M.Ā Black and D.Ā Panagou, āAdaptation for validation of a consolidated control barrier function based control synthesis,ā arXiv preprint arXiv:2209.08170, 2022.
- Q.Ā Nguyen and K.Ā Sreenath, āRobust safety-critical control for dynamic robotics,ā IEEE Transactions on Automatic Control, 2021.
- J.Ā J. Choi, D.Ā Lee, K.Ā Sreenath, C.Ā J. Tomlin, and S.Ā L. Herbert, āRobust control barrierāvalue functions for safety-critical control,ā in 2021 60th IEEE Conference on Decision and Control (CDC).Ā Ā Ā IEEE, 2021, pp. 6814ā6821.
- S.Ā Kolathaya and A.Ā D. Ames, āInput-to-state safety with control barrier functions,ā IEEE control systems letters, vol.Ā 3, no.Ā 1, pp. 108ā113, 2018.
- A.Ā Alan, A.Ā J. Taylor, C.Ā R. He, A.Ā D. Ames, and G.Ā Orosz, āControl barrier functions and input-to-state safety with application to automated vehicles,ā IEEE Transactions on Control Systems Technology, 2023.
- M.Ā Krstic, āInverse optimal safety filters,ā arXiv preprint arXiv:2112.08225, 2021.
- A.Ā J. Taylor, V.Ā D. Dorobantu, H.Ā M. Le, Y.Ā Yue, and A.Ā D. Ames, āEpisodic learning with control lyapunov functions for uncertain robotic systems,ā in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019, pp. 6878ā6884.
- A.Ā J. Taylor, A.Ā Singletary, Y.Ā Yue, and A.Ā Ames, āLearning for safety-critical control with control barrier functions,ā in Learning for Dynamics and Control, 2020, pp. 708ā717.
- T.Ā Westenbroek, F.Ā CastaƱeda, A.Ā Agrawal, S.Ā S. Sastry, and K.Ā Sreenath, āLearning min-norm stabilizing control laws for systems with unknown dynamics,ā in IEEE Conference on Decision and Control, 2020, pp. 737ā744.
- J.Ā Choi, F.Ā CastaƱeda, C.Ā Tomlin, and K.Ā Sreenath, āReinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions,ā in Robotics: Science and Systems, Corvalis, OR, 2020.
- F.Ā Berkenkamp, R.Ā Moriconi, A.Ā P. Schoellig, and A.Ā Krause, āSafe learning of regions of attraction for uncertain, nonlinear systems with gaussian processes,ā in IEEE Conference on Decision and Control, 2016, pp. 4661ā4666.
- F.Ā Berkenkamp, M.Ā Turchetta, A.Ā Schoellig, and A.Ā Krause, āSafe model-based reinforcement learning with stability guarantees,ā in Advances in Neural Information Processing Systems.Ā Ā Ā Curran Associates, Inc., 2017, vol.Ā 30, pp. 908ā918.
- J.Ā F. Fisac, A.Ā K. Akametalu, M.Ā N. Zeilinger, S.Ā Kaynama, J.Ā Gillula, and C.Ā J. Tomlin, āA general safety framework for learning-based control in uncertain robotic systems,ā IEEE Transactions on Automatic Control, vol.Ā 64, no.Ā 7, pp. 2737ā2752, 2018.
- J.Ā Umlauft, L.Ā Pƶhler, and S.Ā Hirche, āAn uncertainty-based control lyapunov approach for control-affine systems modeled by gaussian process,ā IEEE Control Systems Letters, vol.Ā 2, pp. 483ā488, 2018.
- D.Ā D. Fan, J.Ā Nguyen, R.Ā Thakker, N.Ā Alatur, A.Ā a.Ā Agha-mohammadi, and E.Ā A. Theodorou, āBayesian learning-based adaptive control for safety critical systems,ā in IEEE International Conference on Robotics and Automation, 2020, pp. 4093ā4099.
- R.Ā Cheng, M.Ā J. Khojasteh, A.Ā D. Ames, and J.Ā W. Burdick, āSafe multi-agent interaction through robust control barrier functions with learned uncertainties,ā in IEEE Conference on Decision and Control, 2020, pp. 777ā783.
- M.Ā H. Cohen and C.Ā Belta, āSafe exploration in model-based reinforcement learning using control barrier functions,ā arXiv preprint arXiv:2104.08171, 2021.
- F.Ā CastaƱeda, J.Ā J. Choi, B.Ā Zhang, C.Ā J. Tomlin, and K.Ā Sreenath, āGaussian process-based min-norm stabilizing controller for control-affine systems with uncertain input effects and dynamics,ā in American Control Conference, 2021.
- F.Ā CastaƱeda, J.Ā J. Choi, B.Ā Zhang, C.Ā J. Tomlin, and K.Ā Sreenath, āPointwise feasibility of gaussian process-based safety-critical control under model uncertainty,ā in IEEE Conference on Decision and Control, 2021, pp. 6762ā6769.
- A.Ā J. Taylor, V.Ā D. Dorobantu, S.Ā Dean, B.Ā Recht, Y.Ā Yue, and A.Ā D. Ames, āTowards robust data-driven control synthesis for nonlinear systems with actuation uncertainty,ā in IEEE Conference on Decision and Control, 2021, pp. 6469ā6476.
- M.Ā Greeff, A.Ā W. Hall, and A.Ā P. Schoellig, āLearning a stability filter for uncertain differentially flat systems using gaussian processes,ā in IEEE Conference on Decision and Control, 2021, pp. 789ā794.
- V.Ā Dhiman, M.Ā J. Khojasteh, M.Ā Franceschetti, and N.Ā Atanasov, āControl barriers in bayesian learning of system dynamics,ā IEEE Transactions on Automatic Control, 2021.
- L.Ā Brunke, S.Ā Zhou, and A.Ā P. Schoellig, āBarrier bayesian linear regression: Online learning of control barrier conditions for safety-critical control of uncertain systems,ā in Learning for Dynamics and Control, 2022, pp. 881ā892.
- J.Ā Umlauft and S.Ā Hirche, āFeedback linearization based on gaussian processes with event-triggered online learning,ā IEEE Transactions on Automatic Control, vol.Ā 65, no.Ā 10, pp. 4154ā4169, 2019.
- X.Ā Xu, P.Ā Tabuada, J.Ā W. Grizzle, and A.Ā D. Ames, āRobustness of control barrier functions for safety critical control,ā IFAC-PapersOnLine, vol.Ā 48, no.Ā 27, pp. 54ā61, 2015.
- C.Ā Dawson, Z.Ā Qin, S.Ā Gao, and C.Ā Fan, āSafe nonlinear control using robust neural lyapunov-barrier functions,ā in Conference on Robot Learning.Ā Ā Ā PMLR, 2022, pp. 1724ā1735.
- Z.Ā Qin, D.Ā Sun, and C.Ā Fan, āSablas: Learning safe control for black-box dynamical systems,ā IEEE Robotics and Automation Letters, vol.Ā 7, no.Ā 2, pp. 1928ā1935, 2022.
- P.Ā Jagtap, G.Ā J. Pappas, and M.Ā Zamani, āControl barrier functions for unknown nonlinear systems using gaussian processes,ā in 2020 59th IEEE Conference on Decision and Control (CDC).Ā Ā Ā IEEE, 2020, pp. 3699ā3704.
- L.Ā Lindemann, A.Ā Robey, L.Ā Jiang, S.Ā Tu, and N.Ā Matni, āLearning robust output control barrier functions from safe expert demonstrations,ā arXiv preprint arXiv:2111.09971, 2021.
- W.Ā Jin, Z.Ā Wang, Z.Ā Yang, and S.Ā Mou, āNeural certificates for safe control policies,ā arXiv preprint arXiv:2006.08465, 2020.
- D.Ā Duvenaud, āAutomatic model construction with gaussian processes,ā Ph.D. dissertation, University of Cambridge, 2014.
- N.Ā Srinivas, A.Ā Krause, S.Ā Kakade, and M.Ā Seeger, āGaussian process optimization in the bandit setting: No regret and experimental design,ā in International Conference on Machine Learning, 2010.
- J.Ā Umlauft, T.Ā Beckers, A.Ā Capone, A.Ā Lederer, and S.Ā Hirche, āSmart forgetting for safe online learning with gaussian processes,ā in Learning for Dynamics and Control, 2020, pp. 160ā169.
- H.Ā Liu, Y.-S. Ong, X.Ā Shen, and J.Ā Cai, āWhen gaussian process meets big data: A review of scalable gps,ā IEEE transactions on neural networks and learning systems, vol.Ā 31, no.Ā 11, pp. 4405ā4423, 2020.
- S.Ā Dean, A.Ā Taylor, R.Ā Cosner, B.Ā Recht, and A.Ā Ames, āGuaranteeing safety of learned perception modules via measurement-robust control barrier functions,ā in Conference on Robot Learning, 2021.
- J.Ā Buch, S.-C. Liao, and P.Ā Seiler, āRobust control barrier functions with sector-bounded uncertainties,ā IEEE Control Systems Letters, vol.Ā 6, pp. 1994ā1999, 2021.
- T.Ā Lew, A.Ā Sharma, J.Ā Harrison, E.Ā Schmerling, and M.Ā Pavone, āOn the problem of reformulating systems with uncertain dynamics as a stochastic differential equation,ā arXiv preprint arXiv:2111.06084, 2021.
- G.Ā Still, āLectures on parametric optimization: An introduction,ā Optimization Online, 2018.
- J.Ā Lygeros, K.Ā H. Johansson, S.Ā N. Simic, J.Ā Zhang, and S.Ā S. Sastry, āDynamical properties of hybrid automata,ā IEEE Transactions on Automatic Control, vol.Ā 48, no.Ā 1, pp. 2ā17, 2003.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.