Learned Split-Spectrum Metalens
- The paper demonstrates that a learned split-spectrum metalens achieves a 32.29% PSNR gain over hyperbolic designs by rejecting near-depth occluders via spectral splitting.
- It employs a differentiable optimization framework that integrates PSF and image simulators to fine-tune meta-atom phase designs for diffraction-limited performance.
- Neural network postprocessing further enhances imaging quality, improving detection mAP and segmentation IoU, making it ideal for compact, mobile applications.
A learned split-spectrum metalens is a meta-optical system that enables broadband, obstruction-free imaging in the visible spectrum by employing a learned meta-atom phase pattern combined with spectral filtering and post-capture neural-network enhancement. This approach physically rejects signals from near-depth occlusions—such as raindrops, fences, or dust—that would otherwise degrade image quality, all within a compact, single-element design optimized for space-constrained applications.
1. Optical Principles: Depth–Wavelength Symmetry and Spectrum Splitting
A fundamental limitation in diffractive metalenses is the “depth–wavelength symmetry,” wherein a change in either the object depth () or the wavelength () produces equivalent point-spread function (PSF) shifts. Mathematically, the symmetry is given by , with the wavelength–depth relation under the paraxial approximation:
where is object distance, is focal length, and is the design wavelength. A corresponding change in wavelength that mimics a depth shift is:
This symmetry complicates the simultaneous realization of broadband and obstruction-free imaging: near-depth occluders remain in focus across the spectrum. The split-spectrum approach addresses this by dividing each RGB channel into “pass” () and “stop” () spectral bands via a multi-band filter. The metalens is optimized such that for scene depths beyond and , the PSF is diffraction-limited. For near-occluders at , the symmetry ensures focused wavelengths shift into , which are blocked. The overall transfer function for channel , at each spatial location , is thus
where is the amplitude mask (1 in pass band, 0 in stop band), and is the camera’s spectral sensitivity.
2. Meta-Atom Design and Fabrication
The meta-atom building block is fabricated from silicon-nitride (SiNₓ) with a thickness and a unit-cell period . Each unit cell comprises an in-plane bar with width and length . The cell functions as a Pancharatnam–Berry meta-atom: the in-plane orientation angle encodes a local geometric phase . This design enables “geometric phase” control with high efficiency, documented at 78.4%, 72.9%, 66.9% for the center wavelengths of the blue, green, and red pass bands (, , ), respectively, with an average conversion efficiency of over .
Lens fabrication uses high-speed electron-beam lithography (EBL), followed by chromium evaporation as an etch mask, dry etching into SiNₓ, and mask lift-off. The completed metalens features a focal length and aperture . Three lenses were produced for comparison: the learned split-spectrum metalens, a learned broadband metalens (without spectrum split), and a conventional hyperbolic metalens ().
3. Differentiable Optimization and End-to-End Learning
The metalens design leverages a differentiable, end-to-end learning framework to optimize the local orientation map . Two modules are constructed:
- A PSF simulator generating .
- An image simulator generating , where is a clean far-scene DIV2K patch and is a simulated occlusion.
The loss function is:
with quantifying the error to the ground-truth image and enforcing high-Strehl PSFs at over . Importance sampling over proportional to speeds spectral integration during training. Optimization uses Adam on batches of clean/obstruction pairs.
4. Neural Network Postprocessing
Imaging fidelity is further improved through neural network postprocessing with the “LocalNet” (Kim et al., CVPR 2024) architecture: a U-Net–style encoder–decoder with skip connections, positional encoding, and per-channel spectral weighting. LocalNet is trained using metalens-captured, obstruction-obscured DIV2K image patches as input, paired with clean ground-truth images captured through an compound lens. Loss consists of or error, supplemented with perceptual loss (VGG) terms. The positional encoding of enables correction of spatially varying aberrations. Neural enhancement is performed post-hoc without joint optimization with .
5. Experimental Validation and Quantitative Performance
Performance is assessed through PSF imaging (430–645 nm in 5 nm steps) at and , and through imaging of printed 2D targets using a color CMOS sensor with standard Bayer RGB filter and the custom multi-band dichroic filter. Raw RGB images are processed through the pretrained LocalNet.
Quantitative results include:
| Metric | Hyperbolic Metalens | Broadband Learned | Learned Split-Spectrum |
|---|---|---|---|
| PSNR (dB) | 15.84 | 18.79 | 20.94 |
| UAV Detection mAP | 0.0350 | 0.0292 | 0.1704 |
| Kvasir-SEG IoU | 0.3472 | 0.5950 | 0.8317 |
| Cityscapes mIoU | 0.4666 | 0.4601 | 0.6701 |
The split-spectrum design achieves a PSNR gain of 32.29% over the hyperbolic baseline and 11.45% over broadband learned. For vision tasks under obstruction: detection mAP is improved by 13.54% (absolute) over hyperbolic design, IoU for polyp segmentation by 48.45%, and mIoU for semantic segmentation by 20.35%.
6. Advantages, Limitations, and Future Perspectives
The learned split-spectrum metalens provides a single-shot, zero-moving-parts optical solution to near-depth obstruction, within a fully flat, wafer-scale meta-optical form factor (), suitable for mobile robots, drones, and endoscopes. Unobstructed imaging quality also surpasses conventional hyperbolic devices, with a PSNR gain of 23.9 %.
Limitations include reliance on an external multi-band filter; tighter integration of filtering and meta-atom amplitude engineering could improve throughput and size. The design is static for a fixed ; extension to variable-depth or dynamic occluders may require dynamic filtering or event-based approaches. Expanding to other spectral bands (NIR, SWIR) or hyperspectral splitting may further enhance depth discrimination and obstruction removal.
By analytically exploiting and then breaking the depth–wavelength symmetry through learned spectral filtering and meta-atom phase design, the learned split-spectrum metalens demonstrates obstruction rejection and broadband imaging in a single, compact optical element, with observed gains in PSNR and performance on real-world vision tasks (Yoon et al., 27 Jan 2026).