An Optimal Control Theory for Accelerated Optimization

Published 24 Feb 2019 in math.OC | (1902.09004v3)

Abstract: The first-order optimality conditions for a generic nonlinear optimization problem are generated as part of the terminal transversality conditions of an optimal control problem. It is shown that the Lagrangian of the optimization problem is connected to the Hamiltonian of the optimal control problem via a zero-Hamiltonian, infinite-order, singular arc. The necessary conditions for the singular optimal control problem are used to produce an auxiliary controllable dynamical system whose trajectories generate algorithm primitives for the optimization problem. A three-step iterative map for a generic algorithm is designed by a semi-discretization step. Neither the feedback control law nor the differential equation governing the algorithm need be derived explicitly. A search direction is produced by a proximal-aiming-type method that dissipates a control Lyapunov function. New step size procedures based on minimizing control Lyapunov functions along a search vector complete the design of the accelerated algorithms.

Abstract PDF Upgrade to Chat

Citations (5)

View on Semantic Scholar

Summary

The paper introduces an innovative framework that links optimal control theory with nonlinear optimization to derive accelerated algorithms.
It employs Control Lyapunov Functions and proximal aiming to guide trajectories while maintaining a zero-Hamiltonian condition.
Numerical results demonstrate improved convergence speed over traditional methods, highlighting the practical benefits of the approach.

An Optimal Control Theory for Accelerated Optimization

This research explores the utilization of optimal control theory to develop accelerated optimization algorithms. By connecting the principles of optimal control with nonlinear optimization, the paper introduces a novel framework for deriving optimization algorithms with potential extensions into accelerated forms.

Optimal Control Framework for Optimization

The paper presents a framework wherein the first-order optimality conditions for a nonlinear optimization problem are interpreted as terminal transversality conditions of an optimal control problem. The authors establish a link between the optimization problem's Lagrangian and the Hamiltonian of an optimal control problem, which is characterized by a zero-Hamiltonian, infinite-order singular arc. This approach provides a foundation for constructing dynamic systems whose trajectories serve as algorithm primitives for solving optimization problems.

Key Components:

Control Lyapunov Function (CLF): Used to guide the trajectories of the dynamical system into regions that contribute to solving the optimization problem.
Proximal Aiming: A method to adjust control parameters to maintain trajectory adherence to a manifold representing the zero-Hamiltonian condition.
Iterative Mapping: A semi-discretization process yielding a three-step iterative map that does not explicitly require the derivation of a control law or differential equation, thereby facilitating the development of new algorithms.

Dynamics and Singular Control

Two critical differential models underlie the theory:

Velocity Model: Related to traditional optimization via Euler discretization.
Acceleration Model: Suggested as a mechanism to generate accelerated algorithms by employing a double integrator framework, integrating past information to enhance trajectory progression.

Singularities:

The study identifies that the dynamical system must navigate an infinite-order singular arc, highlighting the absence of a general optimal solution and necessitating the development of tailored control approaches to guide the system effectively.

Algorithm Design and Optimal Control

A salient feature of this framework is leveraging the transversality mapping principle (TMP), which connects first-order optimality conditions to terminal conditions in control problems, providing an innovative path to algorithm design. This connection allows the systematic derivation of algorithms that can potentially accelerate convergence, particularly useful in machine learning and large-scale optimization.

Control Laws:

The development of proximal aiming-based control laws is essential, guiding the trajectory without the need for explicit feedback control derivation, thus diverging from conventional numerical methods that focus on discretizing differential equations accurately.

Numerical Methodology and Step Size Adjustment

The exploration extends to a discussion on optimal step size selection, crucial for maintaining algorithm efficiency and stability:

Exact Step Lengths: Proposes methods for calculating exact step lengths to enhance convergence rates.
Approximate Conditions: Employs approximations to derive feasible solutions for step sizes in iterative algorithms.

Practical Application and Numerical Results

A practical illustration demonstrates the application of the proposed framework against a quadratic function minimization task, showcasing the acceleration benefits rendered by the novel methods. The results indicate substantial improvements in convergence speed compared to traditional gradient methods, emphasizing the potential of this approach in real-world optimization challenges.

Conclusion

The study culminates in a verification of the proposed theoretical constructs through numerical examples, asserting that the adaptation of optimal control theory to optimization allows for the construction of both accelerated and unaccelerated algorithms. The paper establishes a new paradigm in designing optimization algorithms, emphasizing proximal aiming and control theory concepts, paving the way for further exploration into complex systems and large-scale data-driven applications.

Markdown Report Issue