Lightweight Simulated Bifurcation
- Lightweight Simulated Bifurcation is a combinatorial optimization algorithm that simplifies the traditional SB method by removing nonlinear terms and quantizing state variables.
- It employs an attention-inspired, graph-topology initializer that reduces iteration counts by up to 80%, significantly accelerating convergence on NP-hard problems like Max-Cut.
- The algorithm is tightly integrated with FeFET-based compute-in-memory hardware, achieving notable speedups and improved solution quality in empirical evaluations.
The Lightweight Simulated Bifurcation Algorithm (light SB) is a hardware-informed, resource-efficient variant of the standard simulated bifurcation (SB) method for combinatorial optimization, specifically targeting large-scale Ising problems. Developed within a ferroelectric compute-in-memory (CiM) framework, light SB is designed to reconcile the algorithmic speed and solution quality of SB with the stringent efficiency and simplicity demands of in-memory hardware, notably FeFET-based crossbar arrays. Its core innovations are the removal of nonlinearities in the update equations and aggressive quantization of internal states, enabling high-throughput, low-latency operation directly on CiM platforms while preserving essential dynamics of the original SB approach (Qian et al., 19 Dec 2025).
1. Mathematical Formulation
The conventional SB algorithm encodes each Ising spin as continuous position and momentum vectors , updated at each discrete timestep as:
where is the Ising coupling matrix, is the time step, ramps up from $0$ to (an annealing parameter), are constants, and indicates element-wise cubic nonlinearity.
The light SB introduces two key simplifications:
- The cubic term is dropped () to eliminate hardware-unfriendly nonlinearities.
- Both and are quantized to ternary values: .
Update rules become:
with .
2. Algorithmic Workflow and Initialization
Light SB operates within a two-step framework:
- Attention-Inspired Graph-Topology Initialization: A domain-informed initialization is performed by calculating a “second-order score” for each node:
Here, , , and is the complement mask of ’s sparsity pattern. Each initial spin is set as
- Light SB Iteration: Initialized with and , the quantized update equations are run for steps, with linearly ramped as . The result is mapped back to .
This two-phase approach allows the iteration count to be reduced by up to 80% compared to conventional SB owing to the informative, topology-driven initialization.
3. Hardware Specialization: Compute-in-Memory Kernels
The light SB algorithm is tightly co-designed with FeFET-based CiM arrays. The main computational kernels are mapped directly to native operations of the hardware:
- Vector-Matrix Multiplication (VMM): is executed via FeFET threshold-programming per device. The ternary is applied along array rows/columns in two passes for and , extracting by current differencing.
- Vector-Matrix-Vector Product (VMV): The attention initializer’s is efficiently realized in a single pass by driving appropriate vectors along bit and source lines and integrating the resulting current.
- Threshold Programming and Reading: All cell writes utilize V, s pulses; readout is via fixed gate bias to yield stored bit.
4. Convergence Behavior and Complexity
Parameter selections follow prior art for SB: , , with uniquely to the light SB simplification. Both and ’s ternary quantization reduces each update to integer operations and a single VMM invocation.
Convergence is rapid on benchmark problems:
- For Max-Cut with up to , light SB converges in iterations, an 80% reduction from conventional SB due to the attention-based initializer.
- For a $32$-node Max-Cut, the algorithm completes in $20$ iterations ( ns) on hardware.
5. Empirical Performance
Measured on large Max-Cut graph instances (), key performance observations include:
| Metric | Gset (N ≤ 7,000) | Yset (N ≤ 105) |
|---|---|---|
| Time-to-solution speedup vs. GPU-SB | 34.5×–160.6× | 19.7×–175.9× |
| Solution quality improvement (max-cut) | +0.53%–6.54% | +0.56%–1.42% |
| Iteration count reduction (via init) | ~80% | ~80% |
| 32-node example: iteration/latency | 20 iterations/900 ns | — |
Attained solutions are consistently superior or comparable to those obtained by GPU-based SB, with both significant improvements in time-to-solution and nontrivial boosts in cut quality due to both algorithmic and hardware optimizations (Qian et al., 19 Dec 2025).
6. Integration and Significance
The light SB algorithm demonstrates that careful removal of nonlinearities and quantized state space, when accompanied by a graph-aware initializer, yields a solver that preserves the essential features of bifurcation-based Ising solvers in a form directly suited to CiM hardware. By mapping the necessary vector-matrix operations to FeFET crossbar primitives and optimizing initialization, it achieves substantial acceleration and quality improvements on NP-hard instances such as Max-Cut, simultaneously addressing central challenges facing in-memory Ising machines—rapid convergence, minimal iteration cost, and solution fidelity at hardware-scale (Qian et al., 19 Dec 2025).