Papers
Topics
Authors
Recent
Search
2000 character limit reached

Primal-Dual Formulation in Mean Field Games

Updated 31 December 2025
  • Primal-dual formulation is a measure-theoretic framework that couples a primal optimization with a dual maximization, ensuring value matching via occupation measures.
  • It converts continuous-time control and mean field games into linear programs by leveraging stochastic process constraints and Hamilton-Jacobi-Bellman (HJB) subsolutions.
  • Strong duality under classical regularity regimes enables precise characterization of Nash equilibria even in nonconvex and nonunique solution settings.

A primal-dual formulation is a measure-theoretic or variational framework that simultaneously expresses an optimization problem (the "primal") and a concave or maximization problem (the "dual"), with value-matching and feasibility conditions linking their solutions. In continuous-time control and game-theoretic contexts, primal-dual formulations enable both the characterization of optimal controls/policies and the precise analytical description of all equilibria (e.g., Nash equilibria in continuous-time mean field games). The technical core is often an equivalence between the original closed-loop control problem and a linear program over occupation measures, together with an abstract dual based on subsolutions to a corresponding Hamilton-Jacobi-Bellman (HJB) equation, resulting in strong duality under minimal regularity assumptions (Guo et al., 2 Mar 2025).

1. Measure-Theoretic Primal Formulation for Control and MFGs

Consider a continuous-time controlled diffusion process for a representative agent with state space Rd\R^d, action space AA (Polish metric), and finite time horizon [0,T][0,T]. Given a deterministic mean-field flow μ=(μt)t[0,T]P(Rd)[0,T]\bm\mu = (\mu_t)_{t\in[0,T]} \in P(\R^d)^{[0,T]} and initial measure ρ\rho, the agent selects a measurable, relaxed Markov policy γ:[0,T]×RdP(A)\gamma: [0,T]\times\R^d \to P(A). The state dynamics follow

Xt=X0+0tbμ,γ(s,Xs)ds+0tσμ,γ(s,Xs)dWsX_t = X_0 + \int_0^t b^{\bm\mu,\gamma}(s,X_s)ds + \int_0^t \sigma^{\bm\mu,\gamma}(s,X_s) dW_s

for appropriate drift/diffusion b,σb,\sigma and dd-dimensional Brownian motion WW.

The occupation measure approach replaces stochastic process optimization with a linear program on X+=M+(Rd)×M+([0,T]×Rd×A)X_+ = M_+(\R^d) \times M_+([0,T]\times\R^d\times A), with ν\nu representing the law of XTX_T and ξ\xi encoding time–state–action occupancy. The key linear "martingale-constraint" for smooth test functions ψW=Cb1,2\psi \in W = C_b^{1,2} is: Rdψ(T,x)ν(dx)Rdψ(0,x)ρ(dx)=[0,T]×Rd×A[tψ+μψ]ξ(dt,dx,da)\int_{\R^d} \psi(T,x) \nu(dx) - \int_{\R^d} \psi(0,x) \rho(dx) = \int_{[0,T]\times\R^d\times A} [\partial_t\psi + {}^{\bm\mu}\psi] \, \xi(dt,dx,da) where μψ{}^{\bm\mu}\psi denotes the controlled generator.

The corresponding primal LP is: VPμ=inf(ν,ξ)DP(μ){f(t,x,a,μt)ξ(dt,dx,da)+g(x,μT)ν(dx)}V_P^{\bm\mu} = \inf_{(\nu,\xi)\in\mathcal{D}_P(\bm\mu)} \left\{ \int f(t,x,a,\mu_t) \xi(dt,dx,da) + \int g(x,\mu_T) \nu(dx) \right\} for bounded measurable running/terminal costs f,gf,g.

An equivalence theorem (Thm 3.5) establishes that the value of the occupation-measure LP matches the original closed-loop stochastic control value: Vclμ=VPμV_{\mathrm{cl}}^{\bm\mu} = V_P^{\bm\mu} with a precise correspondence between optimal policies and optimal occupation measures via disintegration and superposition principles.

2. Dual Formulation: HJB Subsolutions and Abstract Duality

Duality is achieved by constructing the adjoint operator LL^* acting on smooth test functions ψ\psi, leading to dual feasibility conditions:

  • Terminal cost upper bound: g(x,μT)ψ(T,x)g(x,\mu_T) \ge \psi(T,x)
  • Pointwise subsolution constraint: tψ(t,x)+μψ(t,x,a)+f(t,x,a,μt)0\partial_t\psi(t,x) + {}^{\bm\mu}\psi(t,x,a) + f(t,x,a,\mu_t) \ge 0

This yields the dual maximization problem over smooth subsolutions of the formal HJB equation: VDμ=supψDP(μ)Rdψ(0,x)ρ(dx)V_D^{\bm\mu} = \sup_{\psi \in D_{P^*}(\bm\mu)} \int_{\R^d} \psi(0,x) \rho(dx) where

DP(μ)={ψCb1,2([0,T]×Rd):ψ(T,)g(,μT),  tψ+μψ+f0}D_{P^*}(\bm\mu) = \{\psi \in C_b^{1,2}([0,T]\times\R^d) : \psi(T,\cdot) \le g(\cdot,\mu_T), \; \partial_t\psi + {}^{\bm\mu}\psi + f \ge 0 \}

The formal HJB equation for the value function VV takes the shape: tV+infaA{μV(t,x,a)+f(t,x,a,μt)}=0,V(T,x)=g(x,μT)\partial_t V + \inf_{a \in A} \left\{ {}^{\bm\mu}V(t,x,a) + f(t,x,a,\mu_t) \right\} = 0, \quad V(T,x)=g(x,\mu_T) Weak duality (VPμVDμV_P^{\bm\mu} \ge V_D^{\bm\mu}) is immediate by pairing primal-feasible and dual-feasible elements.

3. Strong Duality and Regularity Regimes

Strong duality (VPμ=VDμV_P^{\bm\mu} = V_D^{\bm\mu}) holds when the HJB admits a classical (Cb1,2C_b^{1,2}) solution VV. Under such circumstances, there exists a measurable selector ϕ\phi implementing the minimizer in the HJB, and the deterministic feedback policy γ(dat,x)=δϕ(t,x)(da)\gamma(da|t,x)=\delta_{\phi(t,x)}(da) produces a weak solution to the SDE, saturating the dual constraints.

Elliptic/parabolic regularity arguments differentiate two regimes:

  • Semilinear HJB (uncontrolled σ\sigma) admit classical solutions via Schauder estimates.
  • Fully nonlinear HJB (controlled σ\sigma) with Lipschitz data yield classical solutions via Evans–Krylov theory.

In both, Itô’s formula verifies equality: VPμ=VDμ=V(0,x)ρ(dx)V_P^{\bm\mu} = V_D^{\bm\mu} = \int V(0,x) \rho(dx) The proof structure involves measurable selection, construction of occupation measures, and demonstration of vanishing duality gap.

4. Primal-Dual Characterization of Nash Equilibria

A Nash equilibrium (NE) for MFGs is a triple (μ,X,γ)(\bm\mu^*, X^*, \gamma^*) such that, given flow μ\bm\mu^*, (X,γ)(X^*, \gamma^*) solves the representative control problem and is consistent, μt=Law(Xt)\mu_t^* = \operatorname{Law}(X_t^*).

The primal-dual system characterizing NE (Thm 4.12) is:

  • Primal feasibility: (ξ,μT)DP(μ)(\xi^*, \mu_T^*) \in \mathcal{D}_P(\bm\mu^*)
  • Dual feasibility: ψDP(μ)\psi^* \in D_{P^*}(\bm\mu^*)
  • Value matching: g(x,μT)μT(dx)+fdξ=ψ(0,x)ρ(dx)\int g(x, \mu_T^*) \mu_T^*(dx) + \int f\,d\xi^* = \int \psi^*(0,x)\rho(dx)
  • Consistency: time–state marginal of ξ\xi^* is μtdt\mu_t^*\,dt

When strong duality holds at μ\bm\mu^*, any Nash equilibrium must satisfy this system. Every minimizer or subsolution (whether pure or mixed) is faithfully represented, and joint primal-dual feasibility identifies all NEs.

5. Absence of Convexity and Uniqueness Assumptions

Unlike conventional approaches requiring convexity in the Hamiltonian or uniqueness of the optimizer, this primal-dual framework only presupposes measurability and boundedness of the coefficients (b,σ,f,g)(b, \sigma, f, g). It allows for nonconvex, nonunique minimizers, and remains robust even when the HJB equation lacks classical (or continuous) solutions. Subsolution-based dual feasibility permits NE construction on the support of the actual flow, not just globally.

6. Key Technical Ingredients and Proof Structure

Crucial lemmas include:

  • Disintegration: every occupation measure ξ\xi can be uniquely decomposed into a Markov kernel γ(dat,x)\gamma(da|t,x) and a time-marginal mtX(dx)m^X_t(dx).
  • Superposition principle: continuous solutions to the time-marginal Fokker–Planck equation lift to martingale solutions of the SDE.
  • LP Duality theory: abstract weak duality holds whenever primal and dual cones are nonempty.
  • PDE estimates (Schauder, Evans–Krylov) guarantee existence of classical HJB solutions in the respective regimes.

These technical ingredients underwrite the full identification of Nash equilibria through matched primal-dual feasibility and value matching, closing both the analytical and measure-theoretic duality gap (Guo et al., 2 Mar 2025).

Table: Analytical Structures in Primal-Dual MFG Formulation

Component Mathematical Object Function/Property
Primal LP Occupation measures (ν,ξ)(\nu,\xi) Minimizes expected cost given flow
Dual Problem Smooth subsolutions ψ\psi Maximizes initial value subject to HJB-type inequalities
Constraint Martingale-constraint Eqn Enforces valid controlled diffusion paths
NE Characterization Triple (μ,ξ,ψ)(\bm\mu^*, \xi^*, \psi^*) Feasibility, value matching, law-consistency
Regularity regime Classical HJB solution Ensures strong duality, explicit feedback
Extension Nonconvex/Nonunique regime Subsolution-based feasibility, generalized NE

This primal-dual analytical framework has established itself as a rigorous and complete characterization tool in continuous-time MFGs, general stochastic control, and beyond, providing full identification of Nash equilibria without restrictive convexity or uniqueness conditions.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Primal-Dual Formulation.