Primal-Dual Formulation in Mean Field Games
- Primal-dual formulation is a measure-theoretic framework that couples a primal optimization with a dual maximization, ensuring value matching via occupation measures.
- It converts continuous-time control and mean field games into linear programs by leveraging stochastic process constraints and Hamilton-Jacobi-Bellman (HJB) subsolutions.
- Strong duality under classical regularity regimes enables precise characterization of Nash equilibria even in nonconvex and nonunique solution settings.
A primal-dual formulation is a measure-theoretic or variational framework that simultaneously expresses an optimization problem (the "primal") and a concave or maximization problem (the "dual"), with value-matching and feasibility conditions linking their solutions. In continuous-time control and game-theoretic contexts, primal-dual formulations enable both the characterization of optimal controls/policies and the precise analytical description of all equilibria (e.g., Nash equilibria in continuous-time mean field games). The technical core is often an equivalence between the original closed-loop control problem and a linear program over occupation measures, together with an abstract dual based on subsolutions to a corresponding Hamilton-Jacobi-Bellman (HJB) equation, resulting in strong duality under minimal regularity assumptions (Guo et al., 2 Mar 2025).
1. Measure-Theoretic Primal Formulation for Control and MFGs
Consider a continuous-time controlled diffusion process for a representative agent with state space , action space (Polish metric), and finite time horizon . Given a deterministic mean-field flow and initial measure , the agent selects a measurable, relaxed Markov policy . The state dynamics follow
for appropriate drift/diffusion and -dimensional Brownian motion .
The occupation measure approach replaces stochastic process optimization with a linear program on , with representing the law of and encoding time–state–action occupancy. The key linear "martingale-constraint" for smooth test functions is: where denotes the controlled generator.
The corresponding primal LP is: for bounded measurable running/terminal costs .
An equivalence theorem (Thm 3.5) establishes that the value of the occupation-measure LP matches the original closed-loop stochastic control value: with a precise correspondence between optimal policies and optimal occupation measures via disintegration and superposition principles.
2. Dual Formulation: HJB Subsolutions and Abstract Duality
Duality is achieved by constructing the adjoint operator acting on smooth test functions , leading to dual feasibility conditions:
- Terminal cost upper bound:
- Pointwise subsolution constraint:
This yields the dual maximization problem over smooth subsolutions of the formal HJB equation: where
The formal HJB equation for the value function takes the shape: Weak duality () is immediate by pairing primal-feasible and dual-feasible elements.
3. Strong Duality and Regularity Regimes
Strong duality () holds when the HJB admits a classical () solution . Under such circumstances, there exists a measurable selector implementing the minimizer in the HJB, and the deterministic feedback policy produces a weak solution to the SDE, saturating the dual constraints.
Elliptic/parabolic regularity arguments differentiate two regimes:
- Semilinear HJB (uncontrolled ) admit classical solutions via Schauder estimates.
- Fully nonlinear HJB (controlled ) with Lipschitz data yield classical solutions via Evans–Krylov theory.
In both, Itô’s formula verifies equality: The proof structure involves measurable selection, construction of occupation measures, and demonstration of vanishing duality gap.
4. Primal-Dual Characterization of Nash Equilibria
A Nash equilibrium (NE) for MFGs is a triple such that, given flow , solves the representative control problem and is consistent, .
The primal-dual system characterizing NE (Thm 4.12) is:
- Primal feasibility:
- Dual feasibility:
- Value matching:
- Consistency: time–state marginal of is
When strong duality holds at , any Nash equilibrium must satisfy this system. Every minimizer or subsolution (whether pure or mixed) is faithfully represented, and joint primal-dual feasibility identifies all NEs.
5. Absence of Convexity and Uniqueness Assumptions
Unlike conventional approaches requiring convexity in the Hamiltonian or uniqueness of the optimizer, this primal-dual framework only presupposes measurability and boundedness of the coefficients . It allows for nonconvex, nonunique minimizers, and remains robust even when the HJB equation lacks classical (or continuous) solutions. Subsolution-based dual feasibility permits NE construction on the support of the actual flow, not just globally.
6. Key Technical Ingredients and Proof Structure
Crucial lemmas include:
- Disintegration: every occupation measure can be uniquely decomposed into a Markov kernel and a time-marginal .
- Superposition principle: continuous solutions to the time-marginal Fokker–Planck equation lift to martingale solutions of the SDE.
- LP Duality theory: abstract weak duality holds whenever primal and dual cones are nonempty.
- PDE estimates (Schauder, Evans–Krylov) guarantee existence of classical HJB solutions in the respective regimes.
These technical ingredients underwrite the full identification of Nash equilibria through matched primal-dual feasibility and value matching, closing both the analytical and measure-theoretic duality gap (Guo et al., 2 Mar 2025).
Table: Analytical Structures in Primal-Dual MFG Formulation
| Component | Mathematical Object | Function/Property |
|---|---|---|
| Primal LP | Occupation measures | Minimizes expected cost given flow |
| Dual Problem | Smooth subsolutions | Maximizes initial value subject to HJB-type inequalities |
| Constraint | Martingale-constraint Eqn | Enforces valid controlled diffusion paths |
| NE Characterization | Triple | Feasibility, value matching, law-consistency |
| Regularity regime | Classical HJB solution | Ensures strong duality, explicit feedback |
| Extension | Nonconvex/Nonunique regime | Subsolution-based feasibility, generalized NE |
This primal-dual analytical framework has established itself as a rigorous and complete characterization tool in continuous-time MFGs, general stochastic control, and beyond, providing full identification of Nash equilibria without restrictive convexity or uniqueness conditions.