Does cached retrieval for verbal confidence constitute genuine introspection?
Determine whether the cached retrieval mechanism identified in Gemma 3 27B and Qwen 2.5 7B—where confidence is computed during answer generation, cached at the post-answer-newline token, and later retrieved by the confidence-colon token for verbalization—qualifies as introspection in a stronger sense under rigorous criteria for introspective awareness, rather than merely reflecting retrieval of precomputed internal signals.
References
This is consistent with recent evidence suggesting that LLMs possess some degree of introspective awareness \citep{anthropic2025introspection}, though whether the retrieval process we characterize constitutes introspection in a stronger sense remains an open question.
— How do LLMs Compute Verbal Confidence
(2603.17839 - Kumaran et al., 18 Mar 2026) in Conclusion