Generalizability of ToM-like Emergence Across Model Architectures
Determine whether the emergence of Theory-of-Mind-like behavior observed in memory-equipped large language model poker agents generalizes to language model architectures beyond Anthropic Claude.
References
While cross-model validation of ToM coding with GPT-4o yielded high agreement ($\kappa = 0.81$), the generalizability of ToM-like behavior emergence to other model architectures remains an open question.
— Readable Minds: Emergent Theory-of-Mind-Like Behavior in LLM Poker Agents
(2604.04157 - Lin et al., 5 Apr 2026) in Discussion, Limitations subsection