Papers
Topics
Authors
Recent
Search
2000 character limit reached

Optimal control of continuous-time symmetric systems with unknown dynamics and noisy measurements

Published 20 Mar 2024 in math.OC, cs.SY, and eess.SY | (2403.13605v1)

Abstract: An iterative learning algorithm is presented for continuous-time linear-quadratic optimal control problems where the system is externally symmetric with unknown dynamics. Both finite-horizon and infinite-horizon problems are considered. It is shown that the proposed algorithm is globally convergent to the optimal solution and has some advantages over adaptive dynamic programming, including being unbiased under noisy measurements and having a relatively low computational burden. Numerical experiments show the effectiveness of the results.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (38)
  1. “Mixed L1/H∞subscript𝐿1subscript𝐻L_{1}/H_{\infty}italic_L start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT / italic_H start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT-synthesis for L∞subscript𝐿L_{\infty}italic_L start_POSTSUBSCRIPT ∞ end_POSTSUBSCRIPT-stability” In arXiv:2201.01059, 2022
  2. “Value iteration and adaptive dynamic programming for data-driven adaptive optimal control design” In Automatica 71 Elsevier, 2016, pp. 348–360
  3. “Criterion for the convergence of the solution of the Riccati differential equation” In IEEE Transactions on Automatic Control 26.6 IEEE, 1981, pp. 1232–1242
  4. Yuqing Chen and David J Braun “Iterative online optimal feedback control” In IEEE Transactions on Automatic Control 66.2 IEEE, 2020, pp. 566–580
  5. Bing Chu “Iterative learning control for performance optimisation” In 2016 American Control Conference (ACC), 2016, pp. 2129–2134 IEEE
  6. Claudio De Persis and Pietro Tesi “Designing experiments for data-driven control of nonlinear systems” In IFAC-PapersOnLine 54.9 Elsevier, 2021, pp. 285–290
  7. Claudio De Persis and Pietro Tesi “Formulas for data-driven control: Stabilization, optimality, and robustness” In IEEE Transactions on Automatic Control 65.3 IEEE, 2019, pp. 909–924
  8. Florian Dörfler, Pietro Tesi and Claudio De Persis “On the certainty-equivalence approach to direct data-driven LQR design” In IEEE Transactions on Automatic Control IEEE, 2023
  9. “Global convergence of policy gradient methods for the linear quadratic regulator” In International conference on machine learning, 2018, pp. 1467–1476 PMLR
  10. “Dual-loop iterative optimal control for the finite horizon LQR problem with unknown dynamics” In Systems & Control Letters 111 Elsevier, 2018, pp. 49–57
  11. Georg Frobenius “Über die mit einer Matrix vertauschbaren Matrizen” Reimer, 1910
  12. “Adaptive dynamic programming and adaptive optimal output regulation of linear systems” In IEEE Transactions on Automatic Control 61.12 IEEE, 2016, pp. 4164–4169
  13. Lennart Harnefors “Modeling of three-phase dynamic systems using complex transfer functions and transfer matrices” In IEEE Transactions on Industrial Electronics 54.4 IEEE, 2007, pp. 2239–2248
  14. Kazuhiko Hiramoto, Javad Mohammadpour and Karolos M Grigoriadis “Integrated design of system parameters, control and sensor/actuator placement for symmetric mechanical systems” In Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference, 2009, pp. 2855–2860 IEEE
  15. Sumit Kumar Jha, Sayan Basu Roy and Shubhendu Bhasin “Initial excitation-based iterative algorithm for approximate optimal control of completely unknown LTI systems” In IEEE Transactions on Automatic Control 64.12 IEEE, 2019, pp. 5230–5237
  16. “Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics” In Automatica 48.10 Elsevier, 2012, pp. 2699–2704
  17. David Kleinman “On an iterative technique for Riccati equation computations” In IEEE Transactions on Automatic Control 13.1 IEEE, 1968, pp. 114–115
  18. David L Kleinman “Suboptimal design of linear regulator systems subject to computer storage limitations”, 1967
  19. Omran Kouba and Dennis S Bernstein “What is the adjoint of a linear system?[lecture notes]” In IEEE Control Systems Magazine 40.3 IEEE, 2020, pp. 62–70
  20. Jae Young Lee, Jin Bae Park and Yoon Ho Choi “Integral Q-learning and explorized policy iteration for adaptive optimal control of continuous-time linear systems” In Automatica 48.11 Elsevier, 2012, pp. 2850–2859
  21. Frank L Lewis, Draguna Vrabie and Vassilis L Syrmos “Optimal control” John Wiley & Sons, 2012
  22. Chao Li, Derong Liu and Hongliang Li “Finite horizon optimal tracking control of partially unknown linear continuous-time systems using policy iteration” In IET Control Theory & Applications 9.12 Wiley Online Library, 2015, pp. 1791–1801
  23. “On positive realness, negative imaginariness, and H∞\infty∞ control of state-space symmetric systems” In Automatica 101 Elsevier, 2019, pp. 190–196
  24. Victor G Lopez, Mohammad Alsalti and Matthias A Müller “Efficient off-policy Q-learning for data-based discrete-time LQR problems” In IEEE Transactions on Automatic Control IEEE, 2023
  25. “Data-driven optimal structured control for unknown symmetric systems” In 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE), 2020, pp. 179–184 IEEE
  26. Javad Mohammadpour and Karolos M Grigoriadis “Efficient modeling and control of large-scale systems” Springer Science & Business Media, 2010
  27. David H Owens, Chris T Freeman and Bing Chu “Multivariable norm optimal iterative learning control with auxiliary optimisation” In International Journal of Control 86.6 Taylor & Francis, 2013, pp. 1026–1045
  28. Richard Pates “Passive and reciprocal networks: From simple models to simple optimal controllers” In IEEE Control Systems Magazine 42.3 IEEE, 2022, pp. 73–92
  29. Richard Pates, Carolina Bergeling and Anders Rantzer “On the optimal control of relaxation systems” In 2019 IEEE 58th conference on decision and control (CDC), 2019, pp. 6068–6073 IEEE
  30. Lev Semenovich Pontryagin “Mathematical theory of optimal processes” Routledge, 2018
  31. Vasanth Reddy, Hoda Eldardiry and Almuatazbellah Boker “Singular Perturbation-based Reinforcement Learning of Two-Point Boundary Optimal Control Systems” In 2022 American Control Conference (ACC), 2022, pp. 3323–3328 IEEE
  32. Syed Ali Asad Rizvi and Zongli Lin “Adaptive dynamic programming for model-free global stabilization of control constrained continuous-time systems” In IEEE Transactions on Cybernetics 52.2 IEEE, 2020, pp. 1048–1060
  33. “On the similarity transformation between a matirx and its transpose.”, 1959
  34. Frank Uhlig “Computing matrix symmetrizers, finally possible via the Huang and Nong algorithm” In Linear and Multilinear Algebra 61.7 Taylor & Francis, 2013, pp. 954–969
  35. “Adaptive optimal control for continuous-time linear systems based on policy iteration” In Automatica 45.2 Elsevier, 2009, pp. 477–484
  36. Jan C Willems “Realization of systems with internal passivity and symmetry constraints” In Journal of the Franklin Institute 301.6 Elsevier, 1976, pp. 605–621
  37. Robert L Williams and Douglas A Lawrence “Linear state-space control systems” John Wiley & Sons, 2007
  38. Kedi Xie, Xiao Yu and Weiyao Lan “Optimal output regulation for unknown continuous-time linear systems by internal model and adaptive dynamic programming” In Automatica 146 Elsevier, 2022, pp. 110564
Citations (1)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.