Papers
Topics
Authors
Recent
Search
2000 character limit reached

Trip-Based Sampling Optimization

Updated 14 January 2026
  • The paper presents a novel optimization framework that leverages timetabled bus trips and trip chains to maximize spatial-temporal sensor coverage in urban settings.
  • It employs a sequential three-stage formulation—including bus-line pre-selection, minimum-fleet sizing, and sensor allocation—to efficiently manage computational complexity while ensuring high coverage.
  • The joint bi-level formulation co-optimizes scheduling and sensor placement, reducing sensor requirements by up to 22% and substantially increasing grid–time coverage.

Trip-based sampling is an optimization framework for the deployment of a limited number of mobile sensors on fleet buses, aiming to maximize spatial-temporal coverage for drive-by sensing tasks (such as air quality, traffic state, and road roughness monitoring). The methodology explicitly incorporates timetabled bus trips, exploits the structure of trip chains (ordered sequences of trips served by the same bus), and reconciles operational constraints on minimal fleet size with coverage maximization, all while maintaining computational tractability at city scale (Ji et al., 2023).

1. Problem Setting and Definitions

The spatial domain is discretized into grids gGg \in G (e.g., 1km×1km1\,{\rm km} \times 1\,{\rm km}), and time is segmented into intervals tTt \in T of fixed length Δ\Delta (e.g., 60 min). The bus network consists of L|\mathcal{L}| lines, each with a fixed timetable. A trip iIli\in I_l on line ll is specified as (pi,qi,tli,τi)(p^i,q^i,t_{li},\tau_i): start/end terminals, scheduled departure time, and duration. Dead-heading time tlijt_{lij} defines the non-service interval between consecutive trips i,ji,j by the same vehicle.

A trip chain cc is a feasible, time-respecting sequence of timetabled trips a physical bus can serve in a day. Chains are feasible only if the dead-heading constraints are satisfied: tli+τi+tlijtljt_{li}+\tau_i+t_{lij} \leq t_{lj} for all transitions in the chain. Up to NSN_S identical sensors may be installed, each assigned to a trip chain (i.e., bus), conferring sensing ability for the whole chain.

Coverage is defined at the grid–time pair (g,t)(g,t) level: ngt=1n_{gt}=1 if at least one sensor-equipped bus is in grid gg during tt, zero otherwise. Spatial (wgw_g) and temporal (μt\mu_t) weights (normalized to sum to 1) model heterogeneous monitoring priorities. The global coverage (sensing reward) is

Φ=gGwgtTμtngt.\Phi = \sum_{g\in G} w_g \sum_{t\in T} \mu_t n_{gt}.

Operational constraints include complete fulfillment of the timetable with the minimal fleet size (minimum-fleet principle), ensuring sensor assignment does not compromise service.

2. Sequential Three-Stage Formulation

Stage A: Bus-Line Pre-Selection

To reduce problem size, a set cover is solved to select a minimal subset LLL\subset\mathcal{L} of lines covering at least a fraction γ\gamma of all reachable grids (with γ=1\gamma=1 yielding full coverage). Let δgl=1\delta_{gl}=1 if line ll covers grid gg. The binary program minimizes lxl\sum_{l}x_l subject to constraints ensuring sufficient grid coverage and logical consistency.

Stage B: Minimum-Fleet Sizing per Line

For each selected line lLl\in L, a bipartite matching is solved to minimize the number of buses required while chaining trips into feasible sequences. Variables ylijy_{lij} indicate whether trip jj is served immediately after ii. The minimum fleet for line ll is Nlmin=NIlmaxyi,jylijN_l^{\rm min}=N_{I_l}-\max_{y}\sum_{i,j}y_{lij}, with NIlN_{I_l} the number of trips on ll. Matched pairs are extracted to form all trip chains C=lClC=\cup_l C_l.

Stage C: Sensor Allocation to Trip Chains

Sensor assignment is phrased as a 0-1 integer program over all trip chains. Binary variable zcz_c flags instrumented chains. For each trip and grid–time pair, indicator nigtn_{igt} marks if trip ii covers (g,t)(g,t). Constraints ensure no more than NSN_S sensors are assigned, and that every covered grid–time pair is supported by at least one equipped bus.

These distinct stages—pre-selection, fleet sizing, sensor allocation—frame the trip-based sampling approach as a sequence of linked optimization problems.

3. Joint Bi-level Formulation

The joint bi-level model addresses the sub-optimality arising from fixing trip chains in advance, instead co-optimizing scheduling and sensing assignments per line.

  • Upper Level: Across all lines, integer variables mlm_l distribute the available NSN_S sensors, maximizing total coverage by blending information on how many sensors to assign per line (subject to per-line saturation KlK_l).
  • Lower Level (per line): For a given mlm_l, the problem is to select mlm_l trip chains to be instrumented, optimizing the coverage contributed by that line. Variables (zc,ξcij,ξci,ngt)(z_c,\xi_{cij},\xi_{ci},n_{gt}) model which chains and trip transitions are chosen, and their resulting sensing impact.

The bi-level structure is separable by line, allowing parallel solution, with sensor allocation at the upper level guided by lower-level computations of attainable coverage for each mlm_l.

The two levels interact only through the mappings mlqgt(l,ml)m_l \leftrightarrow q_{gt}^{(l, m_l)}, with qgt(l,m)q_{gt}^{(l, m)} denoting grid–time coverage from line ll equipped with mm sensors.

4. Algorithmic Workflow and Computational Properties

The algorithm proceeds as follows:

  1. Line Pre-Selection: The set cover step significantly reduces the problem size, selecting L\ll |\mathcal{L}| relevant lines.
  2. Per-Line Optimization: For each chosen line,
    • The fleet sizing (bipartite matching) is solved in O(Il2.5)O(|I_l|^{2.5}) time (max-flow/assignment).
    • Model reduction prunes superfluous link variables ξcij\xi_{cij} where idle times exceed a threshold δ\delta, preserving optimal fleet size and saving up to 90% in problem dimensionality.
    • For m=0,1,2,m=0,1,2,\dots, the pruned mixed-integer program is solved to find Φl(m)\Phi_l(m) and associated coverage. Computation stops when further sensors do not increase coverage (at saturation KlK_l).
  3. Global Sensor Allocation: The upper-level knapsack-like integer program (in L|L| variables) allocates NSN_S sensors to lines.

Each line's lower-level problem is independent, and the reduced L|L| after pre-selection enables sub-linear scaling in L|\mathcal{L}|. In contrast, a naïve vehicle-based approach is combinatorial in the total number of buses or trip chains.

5. Empirical Study: Chengdu Case

A comprehensive real-world test covers 400km2400\,{\rm km}^2 within Chengdu’s 4th Ring Road, with $400$ one-kilometer grids and service from 77\,am to 1010\,pm. Three temporal granularities (Δ=60,90,120\Delta=60,\,90,\,120 min) are examined, μt=1/T\mu_t=1/|T|. Spatial weights wgw_g are derived from traffic and emission data.

Of $167$ bus lines, pre-selection (with γ=1\gamma=1) yields L=38L=38 lines ensuring full grid coverage. These require a minimum fleet of $684$ buses for $6,006$ trips. To achieve 90%90\% coverage of grid–time pairs at Δ=60\Delta=60 min, the sequential approach requires $49$ sensors; the joint bi-level model needs only $38$ (a reduction of 22%22\%). The number of grids fully covered in every interval increases by $41$–238%238\% under the joint model. Almost every line saturates at Kl2K_l \leq 2 sensors for Δ=60\Delta=60 min, and Kl=1K_l = 1 for Δ90\Delta \geq 90 min.

Computation times are significantly improved after pre-selection: 1,313s1,313\,s for fleet-sizing on all $167$ lines versus 90s90\,s on $38$ lines; sensor allocation MILPs take minutes instead of 12h12\,h. Pruning with δ100\delta \approx 100 min (idle time) reduces solution time by $25$–60%60\% without degrading coverage.

Aspect Sequential Approach Joint Bi-level Approach
Sensors for 90% cover 49 38
Increase in 100% grids Baseline +41–238%
Saturation per line Kl2K_l\leq2 (60 min) Kl2K_l\leq2 (60 min)

6. Model Extensions and Practical Recommendations

Multiple model extensions are available for operational realism:

  • Service gaps: Dummy trips DlD_l with fixed time windows (e.g., for breaks or charging) can be inserted, with chain assignment constraints.
  • Bus relocations: Forbidden by taking tlij=t_{lij}=\infty or penalized with a multi-objective cost term (βtlijξcij\beta t_{lij} \xi_{cij}).
  • Operational costs: Additional terms for total fleet size (αNlmin\alpha N_l^{\min}) and dead-heading (βtlijylij\sum \beta t_{lij} y_{lij} or βtlijξcij\sum \beta t_{lij} \xi_{cij}).
  • Uncertain service times/speeds: Addressable through robust or stochastic variants, or corrected via subsequent data processing.

A practical rule of thumb is to assign one sensor per selected line and prioritize a second sensor to lines with large one-way trip durations, to close temporal coverage gaps for coarse Δ\Delta.

The trip-based methodology thus tightly integrates the combinatorics of fleet scheduling with the needs of optimal spatial-temporal sensor allocation. It achieves near-optimal city-scale coverage under realistic operational constraints and computational budgets, with the decoupling by lines ensuring both tractability and deployment feasibility (Ji et al., 2023).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Trip-Based Sampling.