Tailoring iterative decoding schemes to exploit soft information from raw nanopore reads

Develop iterative decoding schemes from coding theory that directly exploit soft information derived from raw nanopore sequencing signals—such as the per-sample state probability matrices produced by CTC- or EDHMM-based basecallers—within integrated basecaller–decoder pipelines like SynDe, and determine their performance and computational complexity for DNA data storage readout.

Background

The paper introduces SynDe, a syndrome-trellis–based, beam-search decoder that integrates error-correcting code constraints directly into nanopore basecalling, using soft information from neural network outputs over raw signals. This approach shows competitive-to-superior performance with substantially lower complexity than prior basecaller-integrated decoders.

While SynDe focuses on convolutional and marker codes, the authors note that broader families of iterative decoding methods from coding theory have not yet been adapted to leverage the rich soft information available in raw nanopore reads. Designing such decoders and integrating them with basecalling could extend error-correction capability and efficiency beyond the specific schemes evaluated here.

References

The task of tailoring popular iterative decoding schemes from coding theory to exploit the soft information in raw reads, as done here, is also an open problem that deserves attention.

SynDe: Syndrome-guided Decoding of Raw Nanopore Reads  (2604.01054 - Banerjee et al., 1 Apr 2026) in Discussion, Section 3