Layered Modal ML: Stratified Type Systems

Updated 10 February 2026

Layered Modal ML is a framework that integrates modal type theory with stratified machine learning, enabling safe code manipulation and staged computation.
It enforces strict separation of computational layers using modal operators and box/unbox constructs to achieve type soundness and semantic transparency.
LMML principles extend to practical applications like hierarchical classification and neural network fusion, enhancing performance in multimodal systems.

Layered Modal ML (LMML) refers to a principled synthesis of modal type theory and stratified, multi-layered machine learning and metaprogramming systems. LMML provides a formal framework for managing distinct computational strata—such as computation versus code manipulation, or perceptual layers versus symbolic reasoning—via modal operators, type stratification, and controlled effect boundaries. It supports intensional operations (pattern matching on code), typing and semantic soundness across layers, and informs both the design of metaprogramming languages and modern multimodal architectures.

Layered Modal ML was initially formalized in the context of modal type theories with explicit staging, most notably in simply-typed and dependently-typed systems. The core idea is to partition terms, types, and their operations into discrete strata, typically representing distinct computational modalities. For example, the foundational simply-typed LMML presented by Hu and Pientka fixes two layers: layer 0 for "code" and layer 1 for "normal computation" (Hu et al., 2023). Types and terms are governed by a typing index $i \in \{0,1\}$ and modal constructs:

The type constructor $\Box_1 A$ signifies "code of type $A$ ," only available at layer 1.
Terms like $\mathrm{box}_1\{e\}$ (quotation) and $\mathrm{unbox}_1\,e$ (splicing) are used to construct and destruct code values, respectively.
Staging discipline is enforced: layer 0 forms no box modalities, and $\Box_1$ appears only at $i=1$ .

Advancing to dependent types, DeLaM further stratifies into static-code, dynamic-executable, and meta-programming (tactic) layers, all parameterized over MLTT universes and supporting full recursion on the syntax of both types and terms (Hu et al., 2024).

2. Syntax, Typing, and Operational Semantics

Syntax

LMML typically extends a simply-typed $\lambda$ -calculus or MLTT:

Types: $A,B ::= \alpha \mid \mathbb{N} \mid A \rightarrow B \mid \Box_1 A$
Terms: $e,f ::= x \mid \text{zero} \mid \text{succ}\;e \mid \lambda x.e \mid e\,f \mid \mathrm{box}_1\{e\} \mid \mathrm{unbox}_1\,e \mid \text{match}\;e\;\text{with}\;\{\ldots\}$

Contexts distinguish between global (code) and local (runtime) variables.

Typing

Typing judgments are indexed by the layer, with a core typing rule structure such as: $\Box_1 A$ 0 Key modal typing rules include:

(Box Intro): $\Box_1 A$ 1
(Box Elim): $\Box_1 A$ 2

Operational semantics adopt call-by-value, extending standard evaluation with rules for box/unbox, pattern matching on code, and context-subsumption (lifting pure computations).

Layering discipline ensures:

Strict stratification of modal constructs
Total pattern matching on code syntax
Absence of ad-hoc well-foundedness checks

This enforces that code-intensional features remain encapsulated and never leak into pure computation (Hu et al., 2023).

3. Semantic Models, Soundness, and Full Abstraction

Layered Modal ML admits precise semantic models ensuring type soundness, normalization, and, crucially, full abstraction.

Game semantics are used to model contextual equivalence in LMML, capturing the interaction dynamics between program fragments and their environments (Yin et al., 3 Feb 2026).
Full abstraction is achieved: for closed LMML terms $\Box_1 A$ 3, contextual equivalence coincides with trace equivalence in the game semantics, i.e.,

$\Box_1 A$ 4

CIU (Closed-Instances-of-Use) theorem: contextual equivalence reduces to closed instances after all closing substitutions, reflecting the staging discipline for variable instantiation.

This semantic clarity supports program optimizations, verification of meta-program transformations, and the guarantee that open code can be safely stored and run under higher-order references (Yin et al., 3 Feb 2026).

4. LMML in Multimodal and Hierarchical ML Systems

Layered Modal ML concepts structure recent advances in multimodal machine learning, particularly in the design of fusion and gating mechanisms, as well as hierarchical classification.

In multimodal hierarchical classification, LMML principles guide the design of transitional classifiers: separate classifier heads for each hierarchy level, with gating enforcing parent-child consistency via a taxonomy matrix, yielding increased consistency and leaf-level accuracy (Chen et al., 12 Jan 2025).
DeepMLF implements LMML in sentiment analysis by employing a set of learnable fusion tokens and multimodal cross-attention modules distributed across deep LM layers. The tokens enforce "dedicated multimodal capacity" and the depth of stacked fusion modules (5–7 layers) is a critical performance factor (Georgiou et al., 15 Apr 2025).
Layer selection and staged fusion, inspired by LMML stratification, now guide state-of-the-art practices in visual feature fusion for multimodal LLMs. Empirical studies show selecting and externally fusing a single shallow, middle, and deep visual layer (e.g., layers 3, 18, 23 for CLIPViT-L/24) yields superior, generalizable results, while redundant features from the same stage degrade performance (Lin et al., 8 Mar 2025, Chen et al., 30 Apr 2025).

5. Extensions: Dependent Types, Meta-Level Programming, and Tactics

Recent extensions to LMML include full dependent types and meta-programming capabilities:

DeLaM scales LMML to dependent type theory. It supports safe staging and pattern matching on both terms and types, meta-level introspection, and tactic programming as meta-programs at the meta-layer. This enables a uniform substrate for programs and proof tactics, ensuring soundness, normalization, and decidable conversion (Hu et al., 2024).
Intensional code analysis (e.g., "recursion on code") is formalized at the meta-layer, with application to tactics such as "destruct" for recursion over inductive types.
The system supports universe polymorphism, full recursion on code, and safe staging across static (syntax-only), dynamic (executable), and meta layers.

This rigorous layering yields highly modular, verifiable foundations for metaprogramming languages and proof assistants.

6. Robustness, Practical Applications, and Design Implications

Layered Modal ML offers several practical and methodological benefits:

Separation of concerns: Intensional (code-manipulating) and extensional (run-time) computations are strictly separated.
Total pattern matching: All code constructs are covered, eliminating "stuck" states due to partial matches.
Direct semantic correspondence: The Curry–Howard perspective on staging and code manipulators ensures that staged computation and meta-programming have transparent, compositional models.
Generalization to neural systems: Layering and modal gating concepts inform neural architectures via depth-wise fusion in transformers, hierarchical fusion in multimodal LLMs, and dynamic sparsification strategies (Chen et al., 12 Jan 2025, Georgiou et al., 15 Apr 2025, Lin et al., 8 Mar 2025).
Normalization and decidability: Stratified modal systems support normalization by evaluation (NbE) and decide type convertibility at each layer, supporting verification and strong software engineering guarantees (Hu et al., 2023, Hu et al., 2024).

The LMML discipline is now integral to statically safe metaprogramming, robust hierarchical classification, and practical deep neural architectures for multimodal learning.

Citations:

(Hu et al., 2023, Hu et al., 2024, Chen et al., 12 Jan 2025, Georgiou et al., 15 Apr 2025, Lin et al., 8 Mar 2025, Yin et al., 3 Feb 2026, Chen et al., 30 Apr 2025)