ZeoNet: Geometric Encoding for Zeolite Discovery
- ZeoNet representation is a deep learning paradigm that converts periodic zeolite structures into normalized 3D distance grids, capturing both atomic coordination and mesoscale pore architectures.
- A compact 3D CNN, built with a ResNet-style architecture, extracts a 512-dimensional fingerprint and achieves over 95% classification accuracy compared to traditional geometric filters.
- By filtering over 300,000 hypothetical structures and reducing false positives to 0.4%, ZeoNet streamlines the discovery of synthesizable zeolite frameworks for experimental validation.
The ZeoNet representation is a specialized geometric encoding and deep learning approach designed to distinguish experimentally realizable zeolite frameworks from large computational libraries of hypothetical candidates. Unlike atom-centric graphs or purely topological descriptors, ZeoNet transforms the periodic crystal lattice of a zeolite into a 3D volumetric "distance-to-surface" grid, which encodes both local coordination geometries and mesoscale pore architectures. A compact 3D CNN extracts a 512-dimensional fingerprint from this grid and enables classification with accuracy exceeding prior geometric filters and machine learning baselines. ZeoNet thus enables the systematic assessment of synthetic feasibility for silicate and aluminophosphate zeolite-like materials absent exhaustive physics-based criteria, effectively reducing the practical search space for new synthesizable frameworks (Liu et al., 24 Jan 2026).
1. Construction of the 3D Volumetric Distance Grid
A periodic zeolite unit cell is embedded in a Cartesian box whose side length is set to a crystallographic axis or minimum-image equivalent. A uniform grid of voxels is placed within this box, with spacing and grid point coordinates for . At each , the minimum Euclidean distance to any atom in the framework, , is calculated. Subtracting a probe radius (chosen to reflect typical structure-directing species) yields a signed distance field ; negative values indicate overlap with atomic spheres, while positive values indicate accessible pore volume at least away from the framework. The resulting field is clipped and normalized to via , where is typically .
This volumetric field enables the ZeoNet architecture to capture atomic-scale to mesoscale structural detail, including pore geometry, connectivity, local bond angles, and crystallographic symmetry, in a format compatible with convolutional processing.
2. Input Tensor Formats and Channel Variants
Standard ZeoNet implementations provide input as either a single-channel or two-channel tensor:
- Single-channel: All framework atoms within the unit cell contribute to one distance field, resulting in , with .
- Two-channel: Separates T-atoms (Si or P) and O-atoms, yielding ; channel 0 encodes the distance to the nearest T-atom, channel 1 to the nearest O-atom.
Each channel is mean-centered and scaled to unit variance across the training set; further data augmentation is not applied apart from random sampling among the four classes ("synthesizable as silicate," "as aluminophosphate," "as both," and "not synthesizable") using WeightedRandomSampler.
3. ZeoNet 3D Convolutional Network Architecture
ZeoNet's encoder is a small-scale 3D ResNet-style or simple convolutional-pooling stack. The network ingests the distance grid input (–$48$) and passes it through successive layers:
- Conv3D (), BatchNorm3D, ReLU, MaxPool3D ().
- Conv3D (), BatchNorm3D, ReLU, MaxPool3D ().
- Conv3D (), BatchNorm3D, ReLU, MaxPool3D ().
- Conv3D (), BatchNorm3D, ReLU, MaxPool3D ().
- Fully connected (Flatten Linear ), ReLU, Dropout ().
- Linear (), ReLU, Dropout ().
- Final classification head: Linear () Softmax.
All convolutions use ReLU activations, batch normalization, and weight decay (). The penultimate 512-dimensional vector is designated as the "ZeoNet embedding" and serves as a fixed-length structural fingerprint for visualization and downstream classification tasks.
| Layer Type | Output Shape | Comments |
|---|---|---|
| Conv3D + Pool | Initial feature extraction | |
| Conv3D + Pool | Intermediate depth | |
| Conv3D + Pool | Deeper receptive field | |
| Conv3D + Pool | Last conv layer | |
| Fully Connected | 1024, then 512 | Embedding extraction |
| Linear + Softmax | 4 | Four-class output |
4. Structural Features Encoded by ZeoNet
The distance-grid encoding integrates atomic coordination and pore-scale topology:
- Pore Geometry and Connectivity: Level sets map local pore diameters and channel connectivity; bottlenecks appear as narrow regions.
- Local Coordination: Near T–O bonds, the distance field exhibits conical troughs encoding bond lengths and T–O–T angles, allowing recognition of realistic tetrahedral distortions.
- Global Cell Metrics: The grid directly encodes crystallographic symmetry and cell aspect ratio, empirically correlated with zeolite family classification.
- Probe Radius Selection: reflects the scale of structure-directing and hydrothermal species, emphasizing functional pore environments.
This unified geometric-semantic encoding enables the network to learn empirical patterns that distinguish the roughly 200 known IZA (International Zeolite Association) topologies from the vastly larger pool of hypothetical PCOD (Predicted Crystallographic Online Database) structures.
5. Performance Metrics and Comparative Results
ZeoNet, as reported by Liu et al. (Liu et al., 24 Jan 2026), demonstrates substantive improvements over prior geometric filters and ML methods:
- IZA Four-Class Accuracy: 383 IZA structures (2014 release), only 18 misclassified—top-1 accuracy .
- Balanced Recall Across Classes: Si-only, P-only, and Si/P classes all exhibit recalls exceeding .
- False Positive Suppression: Among hypothetical PCOD structures, only are incorrectly classified as synthesizable, reducing the search space by over compared to baseline approaches.
- Baseline Comparison: Geometric filter methods achieve IZA recall of , while ZeoNet elevates it to and halves the false-positive rate.
- Misclassified Candidates: The small cohort () of false negatives likely represents promising structures for future experimental synthesis, given that their formation energies and bond metrics closely match known zeolites.
6. Implications, Applications, and Limitations
ZeoNet's probe-corrected volumetric grids and convolutional architecture yield a differentiable representation strongly correlated with empirical synthesizability. The approach does not rely on comprehensive physics-based criteria for framework formation, and its high selectivity suggests immediate utility for guiding experimental prioritization in large computational libraries. A plausible implication is that the ZeoNet embedding encodes heretofore unquantified combinations of local and global motifs influential in zeolite formation chemistry.
However, ZeoNet's predictions are contingent on the scope of training data and the particular probe radius used. The subset of hypothetical structures misclassified as "synthesizable" may indicate gaps in current chemical understanding or unrecognized experimental potential. The use of volumetric grids is tailored for periodic, crystalline frameworks and may require adaptation for disordered or non-porous materials.
Together, ZeoNet's representation and classifier provide a robust tool for structural informatics in zeolite science, streamlining the identification of frameworks suitable for synthetic exploration and advancing the methodological state of zeolite discovery (Liu et al., 24 Jan 2026).