Observation-adaptive agents for the “use-when-useful” sensing regime
Characterize and develop reinforcement learning agents that operate in the “use-when-useful” sensing regime, where high-dimensional observations are available but may be selectively ignored, by formalizing when an agent should rely primarily on internal state and open-loop structure and when it should incorporate exteroceptive sensory inputs (e.g., dense flow-field estimates) during both training and execution.
References
To our knowledge, this 'use-when-useful' regime remains largely unexplored.
— Using reinforcement learning to probe the role of feedback in skill acquisition
(2512.08463 - Terpin et al., 9 Dec 2025) in Section 4.1, Privileged information and observation-adaptive agents