Platoon-Based Variance Minimization (PVM)

Updated 23 November 2025

Platoon-based Variance Minimization (PVM) is a control strategy that minimizes delay variance in platoon systems through combinatorial scheduling and stochastic feedback control.
It integrates reservation-based intersection management with mean–variance optimal control in switching diffusion environments to ensure safe, fair, and efficient vehicle flow.
Empirical results demonstrate that PVM significantly reduces average delay and delay variance while boosting throughput and improving fuel efficiency in intelligent transportation networks.

Platoon-based Variance Minimization (PVM) refers to a class of strategies and control methodologies for managing collections (“platoons”) of vehicles or subsystems so as to minimize the variance in experienced delay, control performance, or flow, subject to system constraints. The PVM problem appears in two dominant incarnations: reservation-based intersection scheduling for autonomous vehicle platoons (Bashiri et al., 2018) and mean–variance optimal control for stochastic flow systems with switching diffusion environments (Yang et al., 2014). These approaches combine fairness/minimum-variance scheduling criteria, combinatorial or feedback control algorithms, and system-theoretic safety or stability guarantees, with demonstrated effectiveness in large-scale intelligent transportation settings.

1. Mathematical Formulation and Scheduling Criterion

For autonomous intersection management, the primary objective is to compute a reservation schedule $s$ for $N$ platoons that trades off total vehicle delay and fairness—quantified as variance in delay among platoons. The core cost is the schedule-dependent functional: $\delta_{1}(s) = \sum_{j=2}^{N} j \left( d(p_j) + \sum_{i=1}^{j-1} t_c(p_i) \right)$ where $d(p_j)$ is the delay for platoon $p_j$ , and $t_c(p_i)$ is the clearance time imposed by platoon $p_i$ on later platoons. The optimal schedule $s^*$ solves $\min_{s \in S} \delta_{1}(s)$ over the set $S$ of all feasible, conflict-free platoon orderings and groupings.

In the stochastic control framework, the corresponding objective is a mean–variance criterion for the final state $X(T)$ of the platoon (e.g., total flow, separation distance), where system evolution is subject to both continuous stochasticity and Markovian regime switching: $J^{\epsilon}(u, \lambda) = \mathbb{E}|X^{\epsilon}(T) - z + \lambda|^2 - \lambda^2$ where $z$ is the target mean and control $u(\cdot)$ is designed to minimize terminal variance while meeting mean constraints (Yang et al., 2014).

2. Platoon Interaction Models and State Representation

In reservation-based intersection management, each platoon corresponds to a collection of vehicles arriving on the same lane, with a designated “leader” responsible for communicating to the intersection manager a request packet with position, speed, acceleration, estimated conflict-zone arrival time, and platoon size. The manager maintains a “pool” consisting of at most one platoon per incoming lane (the closest within range) and updates reservations as the pool changes. Confirmations specify reserved entry times, admitting non-conflicting platoons to traverse simultaneously.

For stochastic platoon control, the platoon is modeled as a vector state $X^{\epsilon}(t) \in \mathbb{R}^n$ subject to randomly switching environments modeled as a continuous-time Markov chain $\Xi^{\epsilon}(t)$ on a large finite state space. The controlled dynamics take the Itô SDE form: $dX^{\epsilon}(t) = A(\Xi^{\epsilon}(t)) X^{\epsilon}(t) dt + B(\Xi^{\epsilon}(t)) u(t) dt + \Sigma(\Xi^{\epsilon}(t)) dW(t)$ This formulation covers both aggregate traffic flow and multi-vehicle kinematics, allowing variance optimization across various platoon-level objectives (Yang et al., 2014).

3. Algorithmic Strategies and Complexity

The intersection-manager’s scheduling policy employs a brute-force permutation search and simultaneous batching of non-conflicting platoons. For each feasible permutation, batches of non-conflicting platoons are formed, aggregate delays are computed, and the total cost $\delta_1$ is evaluated. The schedule with minimum $\delta_1$ is chosen. The number of candidate schedules, accounting for all possible simultaneous crossings, is

$T = \sum_{r=1}^{N} \sum_{i=0}^{r-1} (-1)^{i} \binom{r}{i} (r - i)^N$

which is $O(N^N)$ but tractable for small $N$ (e.g., $T=75$ for $N=4$ ). Removing simultaneous permutation reduces complexity to $N!$ (Bashiri et al., 2018).

In stochastic control, direct solution of the coupled Riccati ODEs for all Markov states exhibits $O(m n^3)$ complexity, with $m$ states and $n$ -dimensional process variables. Aggregating the environmental states into $\ell \ll m$ clusters, one instead solves $\ell$ lower-dimensional ODEs, yielding computational speedup by a factor of $m/\ell$ (Yang et al., 2014).

4. Safety, Constraints, and System Guarantees

Intersection-based PVM enforces geometric and temporal exclusivity: reservation policies prohibit simultaneous crossing of platoons with intersecting turn paths within the conflict zone. Pairwise tests for conflict are used in schedule feasibility checks, and, by construction, confirmed entry times never allocate conflicting platoons to overlapping intervals, guaranteeing collision-free operation (Bashiri et al., 2018).

In switching diffusion models, constraints focus on system well-posedness, uniform boundedness, non-degeneracy of noise, and existence of weak solutions to the stochastic control problem. Optimality and near-optimality of the aggregated (clustered) control policy is guaranteed by weak convergence to the limit system as the fast-switching parameter $\epsilon$ approaches zero (Yang et al., 2014).

5. Aggregation, Reduced-order Control, and Weak Convergence

For high-dimensional, rapidly switching environments, aggregation is performed by partitioning the Markov states into clusters and treating each cluster as a “superstate.” The fast intra-cluster transitions (“ $Q^{\text{fast}}$ ”) and slow inter-cluster transitions (“ $Q^{\text{slow}}$ ”) generate an aggregated Markov process. As $\epsilon \to 0$ , the joint process converges weakly to a simpler limit, with dynamics and control governed by cluster-averaged coefficients: $dX(t) = \bar{A}(\alpha(t)) X(t) dt + \bar{B}(\alpha(t)) U(t) dt + \bar{\Sigma}(\alpha(t)) dW(t)$ The optimal control law for the limit system is

$U^*(t) = -[\bar{\Sigma}(\alpha(t)) \bar{\Sigma}(\alpha(t))']^{-1} \bar{B}(\alpha(t))' [X(t) + (\lambda-z)\bar{H}(t,\alpha(t))]$

with the control “lifted” back to the original system by matching each fine-grained state to its cluster (Yang et al., 2014).

6. Empirical and Simulation Results

Intersection-based simulations (traffic flows 500–800 veh/h/lane, max platoon size 1–5) demonstrate that PVM scheduling achieves an average delay of 6.56 s versus 43.26 s for a standard 4-phase traffic light (6.6× faster), with a delay standard deviation of 6.37 s versus 31.44 s (4.9× improvement in reliability), throughput of 1,617 vs. 1,388 veh/h (+13.8%), and fuel consumption reduction from 84 to 77 ml/vehicle (–8%). In all tested scenarios, PVM outperformed traditional intersection control on average delay, delay variance, throughput, and fuel consumption (Bashiri et al., 2018).

In mean–variance switching diffusion, simulations of clustered platoon control for highway contexts (e.g., $n=1$ , $m=4$ states, $2$ clusters), show convergence of the reduced-order control as $\epsilon \to 0$ , with terminal state means and variances collapsing to their aggregated values and CPU time dropping by a factor equal to the state-reduction ratio (Yang et al., 2014).

Metric	4-Phase Light	PVM $(\delta_1)$	Relative Improvement
Avg Delay [s]	43.26	6.56	6.6× faster
Delay Std. Dev. [s]	31.44	6.37	4.9× more reliable
Throughput [veh/h]	1388	1617	+13.8%
Fuel Consumption [ml/veh]	84	77	–8%

7. Extensions and Practical Implications

PVM principles generalize readily to larger intersections and platoon networks, provided the combinatorial scheduling complexity or high-dimensional control can be reduced via heuristics or aggregation. In traffic applications, PVM frameworks are suited to real-time embedded deployment for intelligent intersection management and can accommodate lane-level communication, conflict-geometry definitions, and reservation updates.

In stochastic networks, the identified mean–variance control structures and justification via aggregation (clustering) and weak convergence render PVM approaches tractable even for large infrastructure systems with complex environmental drivers. A plausible implication is that robustness and scalability of variance-minimizing platoon control hold across both micro- and macro-scale transportation domains, as substantiated by empirical and theoretical results (Bashiri et al., 2018, Yang et al., 2014).

Markdown Report Issue Upgrade to Chat

References (2)

PAIM: Platoon-based Autonomous Intersection Management (2018)

A Mean-Variance Control Framework for Platoon Control Problems: Weak Convergence Results and Applications on Reduction of Complexity (2014)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Platoon-based Variance Minimization (PVM).