Papers
Topics
Authors
Recent
Search
2000 character limit reached

Generative Adversarial Variational Quantum KAN

Updated 15 December 2025
  • The framework integrates variational quantum Kolmogorov-Arnold networks with adversarial training to deliver parameter-efficient image synthesis.
  • It employs trainable quantum circuits with spline-parameterized Ry rotations to achieve universal function decomposition and enhanced expressivity.
  • Experiments on MNIST and CIFAR-10 demonstrate competitive accuracy and sample diversity under low-data regimes with far fewer parameters than classical CNNs.

A Generative Adversarial Variational Quantum Kolmogorov–Arnold Network (GAVQKAN) is a quantum-enhanced adversarial generative modeling framework in which the generator is implemented as a variational quantum Kolmogorov–Arnold network (VQ-KAN) and the discriminator is typically classical, often a lightweight convolutional neural network. The approach leverages the Kolmogorov–Arnold representation for universal function decomposition and embeds it within a variational quantum circuit ansatz to achieve competitive sample quality with high parameter efficiency—outperforming neural network and standard quantum GAN baselines in limited-data regimes for tasks such as image synthesis on MNIST and CIFAR-10 (Wakaura, 11 Dec 2025).

1. Theoretical Foundation: Kolmogorov–Arnold Decomposition in Quantum Networks

The classical Kolmogorov–Arnold Network (KAN) is based on the theorem that any continuous multivariate function f:RnRf: \mathbb{R}^n \to \mathbb{R} can be expressed as

f(x)=k=12nϕk(j=1nψjk(xj)),f(x) = \sum_{k=1}^{2n} \phi_k \left( \sum_{j=1}^n \psi_{jk}(x_j) \right),

where ψjk\psi_{jk} and ϕk\phi_k are univariate nonlinear maps. In the VQ-KAN setting, both the inner (ψjk\psi_{jk}) and outer (ϕk\phi_k) maps are parametrized and realized implicitly by trainable quantum circuits ("quantum neurons"). Inputs zRnz \in \mathbb{R}^n are encoded into a quantum state via NqN_q single-qubit Ry rotations: Ψini(z)=j=0Nq1Ryj(zj)0Nq|\Psi_{\text{ini}}(z)\rangle = \bigotimes_{j=0}^{N_q-1} Ry_j(z_j)\,|0\rangle^{\otimes N_q} with zjz_j normalized to f(x)=k=12nϕk(j=1nψjk(xj)),f(x) = \sum_{k=1}^{2n} \phi_k \left( \sum_{j=1}^n \psi_{jk}(x_j) \right),0. Each VQ-KAN layer comprises f(x)=k=12nϕk(j=1nψjk(xj)),f(x) = \sum_{k=1}^{2n} \phi_k \left( \sum_{j=1}^n \psi_{jk}(x_j) \right),1 sub-layers, each applying a pattern of parametric Ry gates (with angles governed by spline-parameterizations, e.g., f(x)=k=12nϕk(j=1nψjk(xj)),f(x) = \sum_{k=1}^{2n} \phi_k \left( \sum_{j=1}^n \psi_{jk}(x_j) \right),2 at quantized grid points) and nearest-neighbor entangling CZ gates. The measurement yields a f(x)=k=12nϕk(j=1nψjk(xj)),f(x) = \sum_{k=1}^{2n} \phi_k \left( \sum_{j=1}^n \psi_{jk}(x_j) \right),3-element probability vector, which is downsampled or averaged for use in subsequent layers or as final output.

2. GAVQKAN GAN Integration: Architecture and Training

In GAVQKAN, the generator is one or more VQ-KAN blocks, each operating on a random latent vector f(x)=k=12nϕk(j=1nψjk(xj)),f(x) = \sum_{k=1}^{2n} \phi_k \left( \sum_{j=1}^n \psi_{jk}(x_j) \right),4 to produce a sequence of measured Born probabilities. For 32×32 (CIFAR-10) or 16×16 (MNIST) images, several VQ-KAN blocks are used in parallel to generate patches, which are stitched to reconstruct the overall image. Each block typically uses f(x)=k=12nϕk(j=1nψjk(xj)),f(x) = \sum_{k=1}^{2n} \phi_k \left( \sum_{j=1}^n \psi_{jk}(x_j) \right),5 layer, f(x)=k=12nϕk(j=1nψjk(xj)),f(x) = \sum_{k=1}^{2n} \phi_k \left( \sum_{j=1}^n \psi_{jk}(x_j) \right),6 sublayers (circuit depth), f(x)=k=12nϕk(j=1nψjk(xj)),f(x) = \sum_{k=1}^{2n} \phi_k \left( \sum_{j=1}^n \psi_{jk}(x_j) \right),7 qubits, 4 spline segments per input variable, and 8 input grid points per spline, leading to 256 trainable parameters per block. The measured probability vector is interpreted as a grayscale image patch. The discriminator is a classical CNN (typically 3 fully connected layers, e.g., [256, 32, 1] for a 16×16 patch) with ReLU activations and sigmoid normalization.

Training proceeds by alternating Stochastic Gradient Descent (SGD) steps on the generator and discriminator. Quantum gradients (for VQ-KAN circuit parameters) are computed with the parameter-shift rule: f(x)=k=12nϕk(j=1nψjk(xj)),f(x) = \sum_{k=1}^{2n} \phi_k \left( \sum_{j=1}^n \psi_{jk}(x_j) \right),8 as implemented in PennyLane (Wakaura, 11 Dec 2025).

3. Loss Functions and Optimization

The adversarial loss follows the standard Goodfellow GAN formulation: f(x)=k=12nϕk(j=1nψjk(xj)),f(x) = \sum_{k=1}^{2n} \phi_k \left( \sum_{j=1}^n \psi_{jk}(x_j) \right),9 specifically,

  • Discriminator: ψjk\psi_{jk}0
  • Generator: ψjk\psi_{jk}1

No explicit quantum-specific regularization is required; the adversarial signal suffices to train the VQ-KAN generator. Optimization in the benchmark is performed with SGD, using learning rates ψjk\psi_{jk}2, ψjk\psi_{jk}3, batch size 1, and up to 1000 iterations (MNIST) or 400 (CIFAR-10).

4. Implementation and Parameter Efficiency

The architecture realizes substantial parameter efficiency. A single VQ-KAN block uses ψjk\psi_{jk}4 parameters (e.g., ψjk\psi_{jk}5), versus ψjk\psi_{jk}6 for a classical CNN and 48 for an 8-qubit, depth-6 conventional QGAN (without KAN structure). Table 1 provides the parameter counts:

Model Qubit count/params Patch size # Trainable Params
VQ-KAN (GAVQKAN) ψjk\psi_{jk}7 ψjk\psi_{jk}8 256 + spline knots
QGAN ψjk\psi_{jk}9 ϕk\phi_k0 48
CNN ϕk\phi_k1 ϕk\phi_k2

Measured on MNIST and CIFAR-10, GAVQKAN achieves competitive Sliced Wasserstein Distance and MSE to the reference data using at least an order of magnitude fewer parameters than the classical baseline, and a factor of 5 less than standard CNNs of similar output size (Wakaura, 11 Dec 2025). Training time is 6.5×–11.5× longer than a shallow CNN or QGAN, a consequence of parameter-shift evaluations on quantum hardware/simulators.

5. Experimental Results and Metrics

Evaluations on MNIST (16×16) and CIFAR-10 (downsampled to 22×22) reveal:

  • For early training (<400 iterations), GAVQKAN generator loss rises rapidly, then stabilizes; discriminator loss plateaus more gradually, indicating more stable adversarial convergence than CNN or QGAN.
  • In Sliced Wasserstein Distance (SWD), GAVQKAN attains the lowest SWD in early epochs for both datasets.
  • Generated images become recognizable by iteration ≈100.
  • GAVQKAN maintains accuracy and sample diversity in low-data regimes (1000 samples), unlike classical NNs that require larger data and parameter counts.
  • A single GAVQKAN generator block of 256 params achieves similar sample realism to much larger classical discriminators.

6. Expressive Power and Theoretical Insights

KANs theoretically approximate any multivariate function using ϕk\phi_k3 outer sums rather than ϕk\phi_k4 parameters of a fully connected network. The quantum extension further boosts expressivity: Born distributions measured from quantum states mix each input's influence non-linearly across all ϕk\phi_k5 outcomes, while parametrized spline-based Ry rotations realize highly flexible, nonlinear, high-fidelity feature transformations. GAVQKAN leverages this to compete with classical benchmarks in quality-to-parameter ratio, and the patch-by-patch synthesis allows efficient generation of long output vectors without parameter scaling proportional to data size.

7. Relationship to Prior Quantum-Adversarial and KAN Models

GAVQKAN extends both quantum GAN and KAN frameworks. It differs from conventional quantum GANs (Zoufal et al., 2019, Dallaire-Demers et al., 2018) by the functional role of its generator: KAN-based decomposition is encoded in the variational circuit, allowing for univariate spline-parameterized modules, rather than generic rotation layers. This achieves both parameter efficiency and nonlinear expressivity. Unlike QGANs with simple rotation/entangler ansätze, GAVQKAN implements the full Kolmogorov–Arnold functional mechanism (Wakaura, 11 Dec 2025). Compared to prior hybrid adversarial schemes (Shu et al., 2024, Al-Othni et al., 13 Jul 2025), it takes advantage of KAN's favorable expressivity scaling and quantum evaluation's Born-sampling bottleneck, producing higher accuracy and diversity under data-scarce regimes.


Key References:

  • Wakaura, et al. "Generative Adversarial Variational Quantum Kolmogorov-Arnold Network" (Wakaura, 11 Dec 2025)
  • Zoufal, et al. "Quantum Generative Adversarial Networks for Learning and Loading Random Distributions" (Zoufal et al., 2019)
  • Shu, et al. "Variational Quantum Circuits Enhanced Generative Adversarial Network" (Shu et al., 2024)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Generative Adversarial Variational Quantum KAN.