Learning Adaptive and Interpretable Multi-Agent Collaboration Policies
Develop learning algorithms that produce adaptive, interpretable collaboration policies for large language model-based multi-agent systems and demonstrate robustness under partial observability and adversarial conditions.
References
A key open problem is how to learn adaptive, interpretable collaboration policies that remain robust under partial observability and adversarial conditions.
— Agentic Reasoning for Large Language Models
(2601.12538 - Wei et al., 18 Jan 2026) in Section 7.4