Papers
Topics
Authors
Recent
Search
2000 character limit reached

Dominator-Based MILP Simplification Framework

Updated 1 December 2025
  • The paper presents a dominator-tree strategy that fixes variables and reduces constraints in MILP formulations for flow decomposition, achieving significant computational speed-ups.
  • The approach uses safe sequence identification, collapsing maximal univocal paths to generalize flow decomposition from DAGs to cyclic graphs.
  • The method integrates linear-time preprocessing with MILP formulation, streamlining constraints and variables and yielding dramatic runtime improvements on biological datasets.

The dominator-based MILP simplification framework addresses flow decomposition problems on general (possibly cyclic) directed graphs via a graph-theoretic technique built on dominator trees. It enables fast and flexible Mixed Integer Linear Programming (MILP) formulations for decomposing flows into walks or paths. Central to the approach is the identification and exploitation of "safe sequences" of edges—those that must necessarily appear as subsequences in any walk cover—allowing for substantial MILP simplification through variable fixing and constraint reduction. This methodology generalizes previous work limited to directed acyclic graphs (DAGs) and is validated with significant computational speed-ups on biological datasets (Sena et al., 24 Nov 2025).

1. Graph-Theoretic Foundations and Dominator Trees

Let G=(V,E)G=(V,E) denote a directed graph (possibly with cycles) with distinguished source ss and sink tt, referred to as an s–t graph. The notion of domination is defined as follows:

  • uu s-dominates vv (denoted usvu \preceq_s v) if every ssvv walk passes through uu.
  • uu t-dominates vv (denoted utvu \preceq_t v) if every vvtt walk passes through uu.

The immediate s-dominator of vv (vsv\neq s) is the unique strict s-dominator of vv minimal under the domination ordering. The s-dominator tree TsT_s organizes VV such that parent(v)=idoms(v)\operatorname{parent}(v)=\operatorname{idom}_s(v) and is rooted at ss. Analogously, the t-dominator tree TtT_t is defined with roles reversed for tt.

These dominator trees capture the structural constraints on walks traversing GG from ss to tt, serving as the basis for safe sequence identification and leveraging in flow decomposition MILPs.

2. Safe Sequences: Characterization and Structural Theorems

Given a collection CEC \subseteq E of edges to be covered, an s–t walk cover is a set of s–t walks such that every edge eCe\in C is present in at least one walk.

A sequence X=(e1,,e)X = (e_1, \ldots, e_\ell) of edges is C-safe if in every s–t walk cover of CC, at least one walk contains XX as a subsequence. The connection to dominator trees is formalized by defining, for each vVv \in V:

extension(v):=(path in Ts from sv)(path in Tt from vt)\operatorname{extension}(v) := (\text{path in } T_s \text{ from } s \to v) \circ (\text{path in } T_t \text{ from } v \to t)

Theorem: A sequence XX of edges is CC-safe if and only if there exists vVv \in V with XX a subsequence of extension(v)\operatorname{extension}(v). Every maximal safe sequence is exactly extension(v)\operatorname{extension}(v) for some vv, where vv is a common leaf of the (possibly collapsed) TsT_s and TtT_t.

To address long chains of non-branching dominators, maximal "univocal" paths—paths shared by both TsT_s and TtT_t without branching—are collapsed into single super-vertices. In the collapsed trees, the set of maximal safe sequences corresponds exactly to the set of extensions of common leaves.

The number of maximal safe sequences is O(n)O(n) and the total output size is o=vextension(v)o = \sum_v |\operatorname{extension}(v)|.

3. Enumeration of Maximal Safe Sequences in Linear Time

All maximal safe sequences can be enumerated exactly once in O(m+o)O(m + o) time given an s–t graph G=(V,E)G=(V,E) and subset CEC\subseteq E.

Procedure:

  1. Compute TsT_s (dominator tree rooted at ss) using the Lengauer–Tarjan algorithm in O(m+n)O(m+n).
  2. Compute TtT_t (dominator tree rooted at tt on the edge-reversed graph, then reverse back).
  3. For both TsT_s and TtT_t, mark all vertices present on extension(w)\operatorname{extension}(w) for some ww covering an edge in CC.
  4. Collapse maximal univocal paths in each tree to single nodes, maintaining the path sequences.
  5. For each vertex vv that is simultaneously a leaf in both collapsed trees, reconstruct extension(v)\operatorname{extension}(v) by concatenating the stored sequences and output to LL.

The process is dominated by the sum total length oo of all safe sequences, in addition to graph traversal and tree operations in O(m+n)O(m+n). The overall complexity is O(m+o)O(m+o) (Sena et al., 24 Nov 2025).

4. Integration with MILP Flow Decomposition Models

The dominator-based approach simplifies and accelerates MILP models for flow decomposition into kk walks.

4.1 Standard (Unsimplified) MILP

For minimum-flow-decomposition (k-FD), the unsimplified MILP consists of:

  • Variables: xuv,ix_{uv,i} (number of times walk ii traverses (u,v)(u,v)), yuv,iy_{uv,i} (reachability helper variables), dv,id_{v,i} (depth labels), wiw_i (walk weights), and slack/error variables as appropriate.
  • Constraints: flow conservation, tree selection (yuv,ixuv,iy_{uv,i} \leq x_{uv,i}), vertex in-degree via yy variables, acyclicity/depth, and bilinear flow matching.

4.2 Safety-Based Preprocessing

Given maximal safe sequences S1,,StS_1, \ldots, S_t, forming a maximum-weight antichain:

  • For iti \leq t, assign SiS_i to walk ii without loss of generality.
  • For every edge e=(u,v)Sie=(u,v) \in S_i:
    • If ee lies between distinct SCCs, set xuv,i=1x_{uv,i} = 1 (no repetitions).
    • Otherwise, set xuv,iSi(e)x_{uv,i} \geq S_i(e) (as determined by sequence multiplicity).
  • For every incompatible edge (u,v)(u',v'), enforce xuv,i=0x_{u'v',i}=0 except when reachability conditions (specified in three cases) are met; these checks require only two BFS/DFS traversals.

The antichain is computed on the condensation DAG, reducing to a max-flow problem in O(m)O(m). The result is that many xx-variables are fixed and others set to zero, shrinking the MILP feasible region and simplifying constraint structure.

4.3 Reduced MILP and Complexity

After safety-based preprocessing:

  • Many variables and constraints (including most bilinear products and reachability/depth constraints) are eliminated for walks iti \leq t.
  • The simplified MILP for k-FD includes only flow-conservation, forced traversals, and edge elimination based on safe sequences for corresponding walks, plus residual variables for the remaining unfixed walks.
  • Preprocessing, including safe sequences and assignment, is O(m+o)O(m+o), typically negligible compared to MILP solution times.

5. Empirical Speed-Ups and Practical Performance

Using four bacterial assembly datasets and three flow-decomposition objectives (Minimum Flow Decomposition, Least Absolute Errors, Minimum Path Error), dominator-based preprocessing yields dramatic computational improvements:

  • Up to 393×393\times faster on Minimum Flow Decomposition,
  • Up to 560×560\times faster on Least Absolute Errors,
  • Up to 1465×1465\times faster on Minimum Path Error.

Instances previously timing out at $300$ seconds are solved in under $30$ seconds.

This demonstrates that dominator-tree safe-sequence fixing is an effective linear-time preprocessing technique. It drastically reduces MILP search spaces and avoids bilinear encodings, all while maintaining solution exactness on general (cyclic) graphs (Sena et al., 24 Nov 2025).

6. Applications and Broader Implications

The dominator-based MILP simplification framework provides robust algorithmic tools for multi-assembly problems and general flow decomposition tasks in graph analysis. Its flexibility with cycles, provable model-size reductions, and empirical acceleration mark it as a foundational building block for future multi-assembly applications. This suggests that dominator-driven approaches could generalize to other combinatorial optimization problems involving path- or walk-covers in complex networks. A plausible implication is that further dominator-theoretic simplifications could be discovered for other MILP-based graph inference problems.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Dominator-Based MILP Simplification Framework.