Credit Assignment for Long-Horizon Agentic Reasoning
Develop principled and generalizable credit-assignment algorithms for long-horizon large language model-based agentic systems that integrate token-level decisions, external tool invocations, skill selection, and memory operations, and enable learning that transfers across extended sequences of episodes and tasks.
References
A core open problem is how to assign credit across tokens, tool calls, skills, and memory updates, and to generalize such learning across a long sequence of episodes and tasks.
— Agentic Reasoning for Large Language Models
(2601.12538 - Wei et al., 18 Jan 2026) in Section 7.2