Cognitive-Flexible Deep Stochastic Model
- The paper demonstrates how integrating deep stochastic latent representation with a bounded cognitive flexibility mechanism achieves safe and rapid online adaptation.
- It employs a surprise-driven online update mechanism within a Bayesian MPC framework to maintain probabilistic safety under abrupt system and observation shifts.
- Empirical evaluations show CF–DeepSSSM outperforms nominal and robust MPC, ensuring optimal safety, comfort, and bounded parameter drift in nonstationary environments.
The Cognitive-Flexible Deep Stochastic State-Space Model (CF–DeepSSSM) is a framework for online adaptation and safety-certified control of partially observed, nonstationary systems. It combines deep stochastic latent-state modeling, surprise-regulated reorganization of inference mappings (cognitive flexibility), and a Bayesian Model Predictive Control (BMPC) architecture with adaptive constraint tightening. The approach is motivated by the challenge of learning-enabled control under abrupt changes in system dynamics or sensing, where fixed-representation models often fail to preserve safety and performance in the presence of distributional shift (Nuchkrua et al., 31 Jan 2026).
1. Stochastic State-Space Modeling and Latent Representation
CF–DeepSSSM models partially observed systems with dynamics and observations: where (true state), (input), (observation), and (unknown disturbances).
A deep stochastic state-space model for the latent belief is constructed via: where is a variational inference network parameterized by . The latent transition and observation dynamics are defined as: 0 with 1 and 2.
The control objective is formulated in belief space, minimizing expected stage costs under chance constraints: 3 for 4, enforcing safety (Nuchkrua et al., 31 Jan 2026).
2. Cognitive Flexibility Index and Bounded Reorganization
To moderate adaptation of the inference mechanism, the Cognitive Flexibility Index (CFI) is defined to regulate the reorganization of the latent belief mapping 5. The CFI constraint enforces bounded change in inference parameters: 6 Operationally, updates to 7 are limited such that 8. This prevents uncontrolled drift of internal representations during adaptation, providing a mechanism for safe and localized model reorganization (Nuchkrua et al., 31 Jan 2026).
3. Online Surprise-Driven Adaptation Mechanism
Online adaptation in CF–DeepSSSM is driven by predictive surprise, which quantifies the negative log-likelihood of an observation under the current model: 9 At each time step, the following procedure is executed:
- Inference: 0, with 1 the observation and input history.
- Control: Solve the BMPC using the current belief.
- Action & Observation: Apply 2, record 3.
- Surprise Calculation: Compute 4.
- Parameter Update: Perform a stochastic gradient step for the generative parameters,
5
where the step-size 6 is chosen to satisfy 7.
This surprise-driven adaptation enables rapid recovery and bounded reorganization in response to abrupt model mismatches, while ensuring that safety is not compromised (Nuchkrua et al., 31 Jan 2026).
4. Embedding in Bayesian Model Predictive Control
CF–DeepSSSM integrates its adaptive latent state model within a finite-horizon Bayesian Model Predictive Control (BMPC) framework: 8
9
To account for epistemic uncertainty in the model, an adaptive constraint tightening procedure replaces each nominal constraint 0 with: 1 where 2 is the predictive mean and 3 is the state covariance. Under Lipschitz-continuity, this guarantees probabilistic satisfaction of the original constraints. The first input is applied; the problem is re-solved at the next step, ensuring recursive feasibility and probabilistic safety at all times (Nuchkrua et al., 31 Jan 2026).
5. Theoretical Guarantees
CF–DeepSSSM provides several central theoretical guarantees under standard (regularity, Lipschitz, bounded noise, compactness) assumptions:
- Bounded Posterior Drift (Theorem 1): For parameter perturbations 4 and step-size 5,
6
- Recursive Feasibility (Theorem 2): Adaptive constraint tightening ensures that feasibility at time 7 implies feasibility at 8.
- Closed-Loop Input-to-State Stability (Theorem 3): The closed-loop belief dynamics are ISS with respect to bounded modeling error.
- Probabilistic Safety Preservation (Corollary): All 9 pairs satisfy 0 with prescribed violation probabilities at every 1.
- Dominant Tightening (Lemma 1): If 2 is 3-Lipschitz and 4, enforcing 5 yields
6
These results provide strong assurances for stability and safety in nonstationary belief dynamics under abrupt or gradual changes (Nuchkrua et al., 31 Jan 2026).
6. Empirical Evaluation and Comparative Analysis
Simulation experiments are conducted on a two-dimensional partially observed system subject to state and input constraints (7, 8). The stage cost penalizes tracking error on 9 and regulates 0 to zero.
Two main scenarios are considered:
- Abrupt Dynamics Shift (1):
- The true system dynamics 2 switch regime.
- CF–DeepSSSM exhibits a spike in predictive surprise (3), a temporally localized rise in 4, followed by rapid convergence.
- State/input constraints remain satisfied throughout.
- Compared to nominal MPC, which violates safety (5), and robust MPC, which achieves perfect safety at higher cost, CF–DeepSSSM maintains perfect safety (6), with optimal comfort cost (7) and bounded mean CFI (8).
| Controller | SafetyRate | ComfortCost | Mean CFI |
|---|---|---|---|
| Nominal MPC | 0.87 | 0.92 | 0.05 |
| Robust MPC | 1.00 | 1.18 | 0.04 |
| CF–DeepSSSM | 1.00 | 0.78 | 0.17 |
- Observation Drift (9):
- System matrices 0 constant; 1 (observation mapping) smoothly drifts (sensor miscalibration).
- CF–DeepSSSM adapts the observation mapping via bounded cognitive flexibility, maintaining high tracking performance and continuous safety (Nuchkrua et al., 31 Jan 2026).
7. Synthesis and Significance
CF–DeepSSSM synthesizes deep stochastic modeling, surprise-constrained latent space reorganization, and rigorous Bayesian MPC for safe online adaptation. Its key attributes include:
- Dynamic adaptation of the inference mapping under formal CFI bounds.
- Explicit embedding in a probabilistically safe BMPC, with adaptive tightening reflecting epistemic uncertainty.
- Theoretical guarantees of bounded parameter drift, recursive feasibility, closed-loop ISS, and probabilistic safety.
- Superior empirical performance compared to nominal and robust MPC under system or sensing shift scenarios.
The methodology demonstrates that learning-enabled, as opposed to learning-based, control architectures can offer both safety and rapid adaptation in nonstationary, partially observed cyber-physical systems (Nuchkrua et al., 31 Jan 2026).