Papers
Topics
Authors
Recent
Search
2000 character limit reached

Recursively Feasible Probabilistic Safe Online Learning with Control Barrier Functions

Published 23 Aug 2022 in eess.SY, cs.LG, cs.SY, and math.OC | (2208.10733v3)

Abstract: Learning-based control has recently shown great efficacy in performing complex tasks for various applications. However, to deploy it in real systems, it is of vital importance to guarantee the system will stay safe. Control Barrier Functions (CBFs) offer mathematical tools for designing safety-preserving controllers for systems with known dynamics. In this article, we first introduce a model-uncertainty-aware reformulation of CBF-based safety-critical controllers using Gaussian Process (GP) regression to close the gap between an approximate mathematical model and the real system, which results in a second-order cone program (SOCP)-based control design. We then present the pointwise feasibility conditions of the resulting safety controller, highlighting the level of richness that the available system information must meet to ensure safety. We use these conditions to devise an event-triggered online data collection strategy that ensures the recursive feasibility of the learned safety controller. Our method works by constantly reasoning about whether the current information is sufficient to ensure safety or if new measurements under active safe exploration are required to reduce the uncertainty. As a result, our proposed framework can guarantee the forward invariance of the safe set defined by the CBF with high probability, even if it contains a priori unexplored regions. We validate the proposed framework in two numerical simulation experiments.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (45)
  1. A.Ā D. Ames, X.Ā Xu, J.Ā W. Grizzle, and P.Ā Tabuada, ā€œControl barrier function based quadratic programs for safety critical systems,ā€ IEEE Transactions on Automatic Control, vol.Ā 62, pp. 3861–3876, 2017.
  2. S.Ā Bansal, M.Ā Chen, S.Ā Herbert, and C.Ā J. Tomlin, ā€œHamilton-jacobi reachability: A brief overview and recent advances,ā€ in 2017 56th IEEE Conference on Decision and Control (CDC), 2017.
  3. K.Ā P. Wabersich and M.Ā N. Zeilinger, ā€œA predictive safety filter for learning-based control of constrained nonlinear dynamical systems,ā€ Automatica, vol. 129, p. 109597, 2021.
  4. Q.Ā Nguyen and K.Ā Sreenath, ā€œL1 adaptive control for bipedal robots with control lyapunov function based quadratic programs,ā€ in American Control Conference, Chicago, IL, July 2015, pp. 862–867.
  5. A.Ā J. Taylor and A.Ā D. Ames, ā€œAdaptive safety with control barrier functions,ā€ in American Control Conference, 2020, pp. 1399–1405.
  6. B.Ā T. Lopez, J.-J.Ā E. Slotine, and J.Ā P. How, ā€œRobust adaptive control barrier functions: An adaptive and data-driven approach to safety,ā€ IEEE Control Systems Letters, vol.Ā 5, no.Ā 3, pp. 1031–1036, 2020.
  7. M.Ā Black and D.Ā Panagou, ā€œAdaptation for validation of a consolidated control barrier function based control synthesis,ā€ arXiv preprint arXiv:2209.08170, 2022.
  8. Q.Ā Nguyen and K.Ā Sreenath, ā€œRobust safety-critical control for dynamic robotics,ā€ IEEE Transactions on Automatic Control, 2021.
  9. J.Ā J. Choi, D.Ā Lee, K.Ā Sreenath, C.Ā J. Tomlin, and S.Ā L. Herbert, ā€œRobust control barrier–value functions for safety-critical control,ā€ in 2021 60th IEEE Conference on Decision and Control (CDC).Ā Ā Ā IEEE, 2021, pp. 6814–6821.
  10. S.Ā Kolathaya and A.Ā D. Ames, ā€œInput-to-state safety with control barrier functions,ā€ IEEE control systems letters, vol.Ā 3, no.Ā 1, pp. 108–113, 2018.
  11. A.Ā Alan, A.Ā J. Taylor, C.Ā R. He, A.Ā D. Ames, and G.Ā Orosz, ā€œControl barrier functions and input-to-state safety with application to automated vehicles,ā€ IEEE Transactions on Control Systems Technology, 2023.
  12. M.Ā Krstic, ā€œInverse optimal safety filters,ā€ arXiv preprint arXiv:2112.08225, 2021.
  13. A.Ā J. Taylor, V.Ā D. Dorobantu, H.Ā M. Le, Y.Ā Yue, and A.Ā D. Ames, ā€œEpisodic learning with control lyapunov functions for uncertain robotic systems,ā€ in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019, pp. 6878–6884.
  14. A.Ā J. Taylor, A.Ā Singletary, Y.Ā Yue, and A.Ā Ames, ā€œLearning for safety-critical control with control barrier functions,ā€ in Learning for Dynamics and Control, 2020, pp. 708–717.
  15. T.Ā Westenbroek, F.Ā CastaƱeda, A.Ā Agrawal, S.Ā S. Sastry, and K.Ā Sreenath, ā€œLearning min-norm stabilizing control laws for systems with unknown dynamics,ā€ in IEEE Conference on Decision and Control, 2020, pp. 737–744.
  16. J.Ā Choi, F.Ā CastaƱeda, C.Ā Tomlin, and K.Ā Sreenath, ā€œReinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions,ā€ in Robotics: Science and Systems, Corvalis, OR, 2020.
  17. F.Ā Berkenkamp, R.Ā Moriconi, A.Ā P. Schoellig, and A.Ā Krause, ā€œSafe learning of regions of attraction for uncertain, nonlinear systems with gaussian processes,ā€ in IEEE Conference on Decision and Control, 2016, pp. 4661–4666.
  18. F.Ā Berkenkamp, M.Ā Turchetta, A.Ā Schoellig, and A.Ā Krause, ā€œSafe model-based reinforcement learning with stability guarantees,ā€ in Advances in Neural Information Processing Systems.Ā Ā Ā Curran Associates, Inc., 2017, vol.Ā 30, pp. 908–918.
  19. J.Ā F. Fisac, A.Ā K. Akametalu, M.Ā N. Zeilinger, S.Ā Kaynama, J.Ā Gillula, and C.Ā J. Tomlin, ā€œA general safety framework for learning-based control in uncertain robotic systems,ā€ IEEE Transactions on Automatic Control, vol.Ā 64, no.Ā 7, pp. 2737–2752, 2018.
  20. J.Ā Umlauft, L.Ā Pƶhler, and S.Ā Hirche, ā€œAn uncertainty-based control lyapunov approach for control-affine systems modeled by gaussian process,ā€ IEEE Control Systems Letters, vol.Ā 2, pp. 483–488, 2018.
  21. D.Ā D. Fan, J.Ā Nguyen, R.Ā Thakker, N.Ā Alatur, A.Ā a.Ā Agha-mohammadi, and E.Ā A. Theodorou, ā€œBayesian learning-based adaptive control for safety critical systems,ā€ in IEEE International Conference on Robotics and Automation, 2020, pp. 4093–4099.
  22. R.Ā Cheng, M.Ā J. Khojasteh, A.Ā D. Ames, and J.Ā W. Burdick, ā€œSafe multi-agent interaction through robust control barrier functions with learned uncertainties,ā€ in IEEE Conference on Decision and Control, 2020, pp. 777–783.
  23. M.Ā H. Cohen and C.Ā Belta, ā€œSafe exploration in model-based reinforcement learning using control barrier functions,ā€ arXiv preprint arXiv:2104.08171, 2021.
  24. F.Ā CastaƱeda, J.Ā J. Choi, B.Ā Zhang, C.Ā J. Tomlin, and K.Ā Sreenath, ā€œGaussian process-based min-norm stabilizing controller for control-affine systems with uncertain input effects and dynamics,ā€ in American Control Conference, 2021.
  25. F.Ā CastaƱeda, J.Ā J. Choi, B.Ā Zhang, C.Ā J. Tomlin, and K.Ā Sreenath, ā€œPointwise feasibility of gaussian process-based safety-critical control under model uncertainty,ā€ in IEEE Conference on Decision and Control, 2021, pp. 6762–6769.
  26. A.Ā J. Taylor, V.Ā D. Dorobantu, S.Ā Dean, B.Ā Recht, Y.Ā Yue, and A.Ā D. Ames, ā€œTowards robust data-driven control synthesis for nonlinear systems with actuation uncertainty,ā€ in IEEE Conference on Decision and Control, 2021, pp. 6469–6476.
  27. M.Ā Greeff, A.Ā W. Hall, and A.Ā P. Schoellig, ā€œLearning a stability filter for uncertain differentially flat systems using gaussian processes,ā€ in IEEE Conference on Decision and Control, 2021, pp. 789–794.
  28. V.Ā Dhiman, M.Ā J. Khojasteh, M.Ā Franceschetti, and N.Ā Atanasov, ā€œControl barriers in bayesian learning of system dynamics,ā€ IEEE Transactions on Automatic Control, 2021.
  29. L.Ā Brunke, S.Ā Zhou, and A.Ā P. Schoellig, ā€œBarrier bayesian linear regression: Online learning of control barrier conditions for safety-critical control of uncertain systems,ā€ in Learning for Dynamics and Control, 2022, pp. 881–892.
  30. J.Ā Umlauft and S.Ā Hirche, ā€œFeedback linearization based on gaussian processes with event-triggered online learning,ā€ IEEE Transactions on Automatic Control, vol.Ā 65, no.Ā 10, pp. 4154–4169, 2019.
  31. X.Ā Xu, P.Ā Tabuada, J.Ā W. Grizzle, and A.Ā D. Ames, ā€œRobustness of control barrier functions for safety critical control,ā€ IFAC-PapersOnLine, vol.Ā 48, no.Ā 27, pp. 54–61, 2015.
  32. C.Ā Dawson, Z.Ā Qin, S.Ā Gao, and C.Ā Fan, ā€œSafe nonlinear control using robust neural lyapunov-barrier functions,ā€ in Conference on Robot Learning.Ā Ā Ā PMLR, 2022, pp. 1724–1735.
  33. Z.Ā Qin, D.Ā Sun, and C.Ā Fan, ā€œSablas: Learning safe control for black-box dynamical systems,ā€ IEEE Robotics and Automation Letters, vol.Ā 7, no.Ā 2, pp. 1928–1935, 2022.
  34. P.Ā Jagtap, G.Ā J. Pappas, and M.Ā Zamani, ā€œControl barrier functions for unknown nonlinear systems using gaussian processes,ā€ in 2020 59th IEEE Conference on Decision and Control (CDC).Ā Ā Ā IEEE, 2020, pp. 3699–3704.
  35. L.Ā Lindemann, A.Ā Robey, L.Ā Jiang, S.Ā Tu, and N.Ā Matni, ā€œLearning robust output control barrier functions from safe expert demonstrations,ā€ arXiv preprint arXiv:2111.09971, 2021.
  36. W.Ā Jin, Z.Ā Wang, Z.Ā Yang, and S.Ā Mou, ā€œNeural certificates for safe control policies,ā€ arXiv preprint arXiv:2006.08465, 2020.
  37. D.Ā Duvenaud, ā€œAutomatic model construction with gaussian processes,ā€ Ph.D. dissertation, University of Cambridge, 2014.
  38. N.Ā Srinivas, A.Ā Krause, S.Ā Kakade, and M.Ā Seeger, ā€œGaussian process optimization in the bandit setting: No regret and experimental design,ā€ in International Conference on Machine Learning, 2010.
  39. J.Ā Umlauft, T.Ā Beckers, A.Ā Capone, A.Ā Lederer, and S.Ā Hirche, ā€œSmart forgetting for safe online learning with gaussian processes,ā€ in Learning for Dynamics and Control, 2020, pp. 160–169.
  40. H.Ā Liu, Y.-S. Ong, X.Ā Shen, and J.Ā Cai, ā€œWhen gaussian process meets big data: A review of scalable gps,ā€ IEEE transactions on neural networks and learning systems, vol.Ā 31, no.Ā 11, pp. 4405–4423, 2020.
  41. S.Ā Dean, A.Ā Taylor, R.Ā Cosner, B.Ā Recht, and A.Ā Ames, ā€œGuaranteeing safety of learned perception modules via measurement-robust control barrier functions,ā€ in Conference on Robot Learning, 2021.
  42. J.Ā Buch, S.-C. Liao, and P.Ā Seiler, ā€œRobust control barrier functions with sector-bounded uncertainties,ā€ IEEE Control Systems Letters, vol.Ā 6, pp. 1994–1999, 2021.
  43. T.Ā Lew, A.Ā Sharma, J.Ā Harrison, E.Ā Schmerling, and M.Ā Pavone, ā€œOn the problem of reformulating systems with uncertain dynamics as a stochastic differential equation,ā€ arXiv preprint arXiv:2111.06084, 2021.
  44. G.Ā Still, ā€œLectures on parametric optimization: An introduction,ā€ Optimization Online, 2018.
  45. J.Ā Lygeros, K.Ā H. Johansson, S.Ā N. Simic, J.Ā Zhang, and S.Ā S. Sastry, ā€œDynamical properties of hybrid automata,ā€ IEEE Transactions on Automatic Control, vol.Ā 48, no.Ā 1, pp. 2–17, 2003.
Citations (5)

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.