Timing-Anomaly-Free Scheduling
- The paper introduces the Deterministic Dynamic Execution (DDE) algorithm, which eliminates timing anomalies by enforcing fixed resource allocation and strict execution order.
- It uses a single offline simulation to safely estimate tight worst-case response times while reducing pessimism and computational complexity.
- Experimental results demonstrate up to a 72% reduction in worst-case response time and a 7–9% decrease in jitter, enhancing predictability in dynamic scheduling.
A timing-anomaly-free dynamic scheduling algorithm is a scheduling technique developed for heterogeneous systems that eliminates timing anomalies—situations where local reductions in task execution time paradoxically result in increased global execution time—by enforcing deterministic constraints on resource allocation and execution order. The Deterministic Dynamic Execution (DDE) algorithm is the first such approach for heterogeneous systems, supporting both safe and tight worst-case response time (WCRT) analysis through a single offline simulation, while reducing pessimism and complexity relative to traditional scheduling strategies (Zhu et al., 28 Jan 2026).
1. System and Task Model
The DDE algorithm is defined for heterogeneous platforms composed of a set of processing-unit instances:
Each instance is specified by its architecture (e.g., CPU, GPU), micro-type (e.g., high-performance CPU versus low-power CPU), and index , yielding the notation .
Tasksets are represented as multi-typed DAGs. A DAG consists of:
- : finite set of tasks (nodes)
- : precedence constraints (edges)
- : maps each task to its eligible processor-unit types
- : gives execution-time intervals as for best-case and worst-case execution times
The response time of a task , , is the time from the DAG source until ’s finish time , with worst-case response time for a scheduler defined as:
where is the set of legal schedules under .
2. Timing Anomalies in Dynamic Scheduling
Timing anomalies occur in dynamic scheduling when local improvements—such as reductions in execution time for individual tasks—lead to increased global response times, either by altering resource contention or task dispatch order.
Formal Model: System execution is represented as a finite-state machine mapping each to a progress tuple . A strict timing anomaly is present if, for two reachable states :
- There exists a non-empty subset such that all , is strictly less progressed than , while for all other tasks
- Future execution from is at least as large as from for tasks not in execution
- Yet, after some number of steps , the state derived from is no longer ahead of that from
Illustrative Example: Under the HBFS policy, a local reduction in a task’s execution time may shift resource contention, indirectly delaying the critical chain and resulting in a higher overall completion time than the case with no reduction [(Zhu et al., 28 Jan 2026), see Fig. 3].
3. Deterministic Dynamic Execution Algorithm
DDE enforces two primary constraints at runtime to prevent timing anomalies:
Constraint 1: Resource-Allocation Determinism
Each task is fixed in advance to execute on a predetermined processor type .
Constraint 2: Execution-Order Determinism
Tasks are ordered offline into a total order that respects DAG precedence. No task may begin execution before all its predecessors in have reached the "started" state.
DDE Execution-Progress Model
- The system state , with initial state set to , except for the source node which starts at .
- The transition function updates progress of each task by checking dependency completion, resource availability, and advancing time.
Algorithm Sketch
At each time tick:
- Add newly ready tasks to ReadyQ.
- Select the ReadyQ task with minimum index in .
- If all its predecessors have started and a resource of type is available, dispatch and mark as started.
- Advance one tick, decrement execution counters, and update stages.
Computational complexity is , with the simulated WCRT and the number of compute units.
4. WCRT Estimation via Offline Simulation
Under DDE, WCRT is determined by a single offline simulation where each task is allocated its WCET on the assigned resource. Due to monotonicity of execution progress under the DDE constraints, the simulated response time bounds all real executions:
This simulation avoids the need for exhaustive state-space exploration ordinarily required when timing anomalies are possible.
5. Generation of Execution Constraints
Constraints for DDE are generated by two methods.
(a) Trace-Based Extraction: Run the baseline scheduler (e.g., HBFS) offline on the DAG with all tasks at their WCETs. Extract, for each task , its start and finish times and resource type. Sort tasks by start time (ties broken by ID) to derive , and fix to the corresponding instance type.
(b) Heuristic HACPA Method: Assign ranks bottom-up as
Tasks are processed in descending order of rank; each receives the processor type and resource minimizing its finish time under WCET and dependency constraints. The resulting trace yields and .
6. Formal Guarantees and Anomaly-Freeness
Monotonicity Lemmas:
- Every state transition advances at least one task’s progress.
- Dependency completions are preserved under progress ordering.
- Advancement in any task's progress in a "smaller" state remains at least as advanced in a "larger" state after a transition.
Strict-TA-Free Theorem (Theorem 3): Under the two DDE constraints, for all relevant state pairs and all , the simulation from a state strictly ahead in progress remains (at least) as advanced at every later step. This ensures absence of both strict and generic timing anomalies, so the simulated WCRT is both safe (upper-bounding real execution) and tight.
7. Experimental Evaluation and Results
Tasksets and Configuration
- Random task DAGs of size with density
- Each DAG given 100 configurations: 4 processor types (CPU, CPU, GPU, GPU)
- Node execution intervals: 80% assigned heavy (WCET 10–30 BCET), 20% light (WCET 1–1.2 BCET)
- Resource availability: 1, 2, or 4 instances per type
Metrics
| Metric | Definition |
|---|---|
| MSWCRT | Maximum observed RT over 10,000 random runs under scheduler |
| WCRT | Offline simulated all-WCET bound under scheduler |
| AVRT | Average response time |
| jitter |
Results
- Probability of timing anomaly: up to 45% under HFCFS for sparse graphs; 3% for HBFS.
- DDE eliminates all observed timing anomalies: .
- DDE reduces WCRT by average 5–25% compared to baseline MSWCRT, with best-case reduction of 72%. HACPA variant provides an additional 2–5% reduction.
- Jitter reduced by 7–9% on average.
- Tradeoff in AVRT: increase by 4.2% overall (from including non-TA cases), while AVRT decreases by 1.2% for TA-exhibiting systems (best case reduction −40%).
8. Significance and Implications
Deterministic Dynamic Execution establishes the first provably strict timing-anomaly-free algorithm for dynamic scheduling on heterogeneous systems. By imposing lightweight, offline-derived deterministic constraints, it converts a challenging real-time analysis problem—complicated by non-monotonic progress of dynamic schedules—into a tractable single simulation. The approach demonstrably eliminates timing anomalies, allows safe and tight WCRT estimation, reduces worst-case and jitter metrics, and does so with minimal negative impact on average-case performance. A plausible implication is that DDE provides a scalable foundation for predictable execution in safety-critical and hard real-time applications deployed on modern heterogeneous compute platforms (Zhu et al., 28 Jan 2026).