Mechanism behind low chain-of-thought controllability
Identify and characterize the causal mechanisms in training and inference that lead contemporary reasoning models to exhibit low chain-of-thought controllability relative to output controllability.
References
However, the mechanism behind low controllability is not well understood.
— Reasoning Models Struggle to Control their Chains of Thought
(2603.05706 - Yueh-Han et al., 5 Mar 2026) in Abstract