SEQC: Stratify-Elaborate Quantum Compiler
- SEQC is a hierarchical, parallelized quantum compiler framework designed for scalable chiplet-based quantum computing.
- It employs a two-stage stratification-elaboration process to partition large circuits into chiplet-local subcircuits, reducing optimization complexity significantly.
- The framework leverages parallelized simulated annealing and SABRE-inspired heuristics to enhance routing fidelity and compile speed by up to 7× compared to traditional methods.
The Stratify-Elaborate Quantum Compiler (SEQC) is a hierarchical, parallelized compilation framework designed for chiplet-based quantum architectures. As the field transitions toward modular quantum devices composed of multiple interconnected chiplets, SEQC addresses the critical scalability challenge posed by heterogeneous inter-chiplet connectivity, highly variable gate fidelities, and the need for efficient cross-chiplet allocation and routing. The compiler decomposes large circuits into chiplet-local subcircuits, compiles them in parallel, and optimizes both for end-to-end fidelity and compilation throughput, establishing itself as an effective strategy for scalable modular quantum computation (Jeng et al., 14 Jan 2025).
1. Hierarchical Pipeline Structure
SEQC divides the compilation process into two distinct stages optimized for chiplet-based hardware:
- Stratification (one-time per device topology): An input -qubit circuit is partitioned into a set of subcircuits , each mapped to at most qubits, where is the chiplet size. The partition aims to minimize future inter-chiplet communication by grouping densely interacting qubits.
- Elaboration (per compilation): Each subcircuit is compiled independently onto its assigned chiplet. The process encompasses qubit placement and routing, basis translation into native gate sets, and localized, parallel circuit-level optimizations.
This stratified approach confines high-cost optimization steps to chiplet-local subcircuits, reducing the per-chiplet complexity from to , which considerably accelerates compilation for large-scale designs.
2. Qubit Assignment and Partitioning Algorithms
During stratification, SEQC employs a parallelized simulated-annealing heuristic to partition logical qubits into subcircuits:
- Cost Graph Construction: Nodes represent logical qubits; edges connect weighted by the count of two-qubit gates between them.
- Partition Objective:
- Multiple independent annealing trials run in parallel. Each annealing step swaps qubits between blocks if it reduces . The minimum-cost solution is selected.
Subcircuit-to-chiplet assignment then adapts SABRE-like heuristics for the chiplet supergraph, introducing three classes of SWAP operations: symbiotic (benefits two cross-chiplet edges), commensalistic (helps one without harming others), and parasitic (adverse effect). Candidate layouts are ranked by fidelity-weighted shortest paths, giving preference to high-fidelity, low-latency links between chiplets.
3. Subcircuit Compilation and Optimization
Once mapping is fixed, each subcircuit undergoes an elaboration pipeline:
- Placement: Selects an injective mapping maximizing an estimated gate fidelity via SABRE-style lookahead:
- Routing: For non-adjacent two-qubit gates, inserts a minimal-weight path of SWAPs, with cost per edge , balancing fidelity and latency .
- Basis Translation: Decomposes logical gates to the chiplet’s native gate set using polynomial-time algorithms and caches for small unitaries.
- Circuit Optimization: Commutation-and-cancellation is applied in parallel within each chiplet subcircuit. Due to strict partitioning, no cross-chiplet optimizations are needed post-stratification.
4. Handling Inter-Chiplet Links
SEQC differentiates between intra-chiplet and inter-chiplet connections, modeling them as distinct edge sets:
- Resource Heterogeneity: Typically, intra-chiplet gates demonstrate higher fidelity () and lower duration ().
- Pre-stratification Routing: Gates necessitating cross-chiplet communication are assigned to “halo” qubits at the chiplet periphery to minimize interaction range.
- Immutable Cross-Chiplet SWAPs: Inter-chiplet routes established during stratification remain fixed during elaboration; all subsequent placement/routing cost functions respect the true and values to discourage unnecessary high-cost inter-chiplet operations.
5. Performance Metrics and Experimental Results
SEQC benchmarking utilizes circuits from the Supermarq suite (BitCode, PhaseCode, GHZ, VQE, Hamiltonian-Simulation), scaled up to 1000 qubits. The chiplet model assumes a grid of heavy-hexagon 10-qubit chiplets with the following native parameters: intra-chiplet CZ fidelity 99.4%, duration 34 ns; inter-chiplet SWAP fidelity 89.8%, duration 1400 ns.
| Metric | SEQC Result | Comparison |
|---|---|---|
| Avg. circuit fidelity gain | +9.3% (up to +49.99%) | Qiskit “peephole” baseline |
| Avg. compile speedup | (max ) | Chiplet-unaware Qiskit |
| Execution time per shot | lower | |
| Stratification cost | baseline single-run | Zero overhead on reuse |
A Qiskit 1.2.4 “peephole” routine, modified for chiplet-unaware baseline comparison, introduces SWAPs post hoc, but does not support fidelity- or latency-aware optimization.
6. Parallelization and Scalability
SEQC exploits parallelism at two tiers:
- Stratification stage: Each annealing chain is processed in a separate OS thread/process, yielding a pool of independent solutions.
- Elaboration stage: Subcircuits are mapped and optimized independently, enabling chiplet-level parallel compilation. For chiplets of size , this reduces overall complexity from to .
- Resulting speedups are empirically observed between and , with elaboration dominating efficiency gains in large circuits.
7. Flexibility and Hyperparameter Tuning
SEQC hyperparameters and objective weights can be tuned for various topologies and device constraints:
- The qubit-to-subcircuit annealer’s schedule and weights are retargetable to accommodate different chiplet numbers and non-uniform or evolving hardware graphs.
- Routing cost weights enable balancing between minimal latency and maximum circuit fidelity.
- For exotic modular topologies, including those with nonuniform or routed interconnects, adjusting the edge weights in the physical chiplet graph is sufficient; the two-stage logic and fidelity-weighted heuristics extend without modification.
The hierarchical separation of stratification and elaboration, along with chiplet-aware routing, enables highly scalable and hardware-optimized compilation that can adapt to the requirements and constraints of next-generation modular quantum devices (Jeng et al., 14 Jan 2025).