Richelieu Framework Overview

Updated 26 September 2025

Richelieu Framework is a modular system for self-evolving agents that integrates strategic planning, negotiation, and self-play in multi-agent settings.
It decomposes high-level objectives into sub-goals using memory retrieval and reflection to refine strategies based on historical successes and failures.
The framework leverages argumentation theory with SAT-based reasoning to handle uncertainty in negotiations, enabling dynamic alliances and strategic deception.

The Richelieu Framework denotes a modular system for creating self-evolving agents in complex multi-agent settings, especially AI-driven Diplomacy scenarios, integrating capabilities from contemporary LLM-based agents and argumentation theory. Core principles include strategic planning informed by memory and reflection, goal-driven negotiation shaped by social reasoning, and continual adaptation through self-play. Richelieu agents operate without reliance on human demonstration data, and are structured to maximize long-term coordination and negotiation efficiency in highly combinatorial environments.

1. Strategic Planning via Memory and Reflection

Richelieu agents instantiate strategic planning by decomposing high-level objectives into executable sub-goals. This mechanism involves analysis of the current game state $s_t$ and inference of social beliefs $\varphi_t$ representing opponents’ intentions. Sub-goals $\chi_t$ are generated through:

$\chi_t \leftarrow SR(s_t, \varphi_t, \Upsilon)$

where $\Upsilon$ is a vector encoding the long-term victory target and $SR$ denotes the planner’s strategic reasoning module. Historical experiences, archived in a memory buffer $M$ , are retrieved for “reflective” optimization via a similarity metric $h$ , yielding an experience vector $\eta_t$ :

$\eta_t \leftarrow h(s_t, \chi_t, M)$

This reflection process aligns current sub-goals with precedents to reinforce strategies that previously succeeded and revise those that failed. Richelieu’s planning paradigm draws on related work (including Re2LLM and “Devil’s Advocate” reflective methods) but specifically emphasizes long-horizon reasoning, which marks a methodological shift from short-sighted tactical planning in prior systems (Guan et al., 2024).

Goal-driven negotiation is a principal feature. The agent’s negotiation policy first parses the “social state”—estimating other agents’ goals and credibility from their actions and communications. The social reasoning module evaluates the veracity of received messages $m_{t}^{(j, i)}$ using inferred sub-goals $\chi_t^j$ , state $s_t$ , and a credibility score $\gamma_j$ :

$\psi_t^j \leftarrow g(s_t, \chi_t^j, m_{t}^{(j, i)}, \varphi_t, \gamma_j, \xi_t)$

$\psi_t^j$ quantifies the trustworthiness of agent $j$ , integrating real-time observations and memory-retrieved negotiation experience $\xi_t$ . This evaluation governs strategic decisions such as alliance formation or deception, as illustrated by case studies where Richelieu as France proposes a ceasefire to Austria for long-term geopolitical leverage, or as Germany feigns alliance with England while securing a defensive posture elsewhere. The significances include dynamic adaptation of negotiation language based on continuously updated social beliefs and reflection-derived trust metrics (Guan et al., 2024).

3. Self-Evolution through Self-Play

The framework’s self-evolution mechanism is predicated on multi-agent self-play, wherein Richelieu instances control multiple powers in simulated Diplomacy matches. All experience from such games—map states, actions, negotiations—is stored in $M$ . Performance metrics in the paper indicate sustained improvement: win rate increases, and defeat rate decreases over training epochs, plateauing as policy mastery emerges.

This approach obviates the need for external human demonstration corpora, continuously refining planning and negotiation modules via iterative exposure to adversarial and cooperative situations. A plausible implication is that memory quality and diversity critically affect self-evolution rates, with increasing size enhancing adaptation but risking redundancy (Guan et al., 2024).

4. System Architecture and Implementation Workflow

The Richelieu Framework consists of the following modular components:

Component	Role	Key Inputs/Outputs
Social Reasoning Module	Infer intentions of other agents	$s_t$ , messages, yields $\varphi_t$
Planner with Reflection	Generate and refine sub-goals	$s_t$ , $\varphi_t$ , $M$ ; yields $\chi_t$ , $\eta_t$
Negotiator	Produce negotiation acts, assess opponent trust	$\chi_t$ , messages, $\psi_t^j$
Actor	Executes moves/negotiations	Refined sub-goals, updated state
Memory Management	Stores and retrieves experience	Archive of $s_t$ , actions, messages
Self-Evolution	Trains via multi-agent self-play	Experience, updated memory

Workflow proceeds as: initialize scenario $\rightarrow$ social reasoning $\rightarrow$ planning with reflection $\rightarrow$ negotiation $\rightarrow$ action selection $\rightarrow$ memory update $\rightarrow$ self-evolution.

Distinctively, Richelieu tightly couples long-term reflection and strategic reasoning to both move selection and dialogue generation, differing from modular or static approaches typical in prior LLM agents.

5. Integration with Argumentation Frameworks

The Richelieu Framework is augmented by Rich Incomplete Argumentation Frameworks (RIAFs), designed to model qualitative uncertainty in argumentation processes (Mailly, 2020). RIAFs are formalized as:

$\text{RIAF} = \langle A, A^?, R, R^?, \leftrightarrow^? \rangle$

with $A$ (certain arguments), $A^?$ (uncertain arguments), $R$ (certain attacks), $R^?$ (uncertain attacks), and $\leftrightarrow^?$ (symmetric uncertain conflict). For each uncertain conflict $(a, b) \in \leftrightarrow^?$ at least one directionality is picked in a completion.

The SAT-based computational approach for RIAFs encodes such uncertainties via clauses like $(r_{a, b} \vee r_{b, a})$ , adapting existing methodologies for computational reasoning without increased complexity compared to IAFs. This provides Richelieu agents with a formal basis for reasoning over incompletely specified or ambiguous argumentative structures in real-world negotiations.

6. Applications and Future Research

Applications include automated diplomatic simulation, business negotiation modeling, military or strategic intelligence scenarios, and any multi-agent system requiring adaptive planning and social coordination.

Suggested future directions are:

Extension to scenarios with fundamentally incomplete information, closely modeling nondeterminacy typical in human diplomacy.
Optimization of memory retrieval for enhanced experience utility, mitigating risk of redundancy with increased storage.
Deployment in robotics or multi-agent video games, exploiting the framework’s reflective, self-evolving properties.
Enhancement through multi-modal input processing and dynamic, real-time learning paradigms.

These trajectories indicate utility in both research and operational contexts where agents must self-improve, synthesize long-term memories, and negotiate within highly dynamic, uncertain social environments.

7. Significance and Theoretical Foundation

The Richelieu Framework synthesizes strategic planning, negotiation, reflection, and argumentation uncertainty into a unified agent architecture. Its theoretical underpinnings derive from both state-of-the-art LLM agent design and formal argumentation models. The harmonization of self-play-driven evolution and SAT-adaptable argumentation reasoning furnishes a platform for the deployment of autonomous agents in complex, multi-agent social processes, with scalability and expressivity documented in formal analyses. Adoption of RIAFs as the underlying argumentation structure directly expands the representational capacity and robustness of Richelieu, ensuring applicability in domains where incomplete, uncertain, or ambiguous information dominates the interaction landscape.

Markdown Report Issue Upgrade to Chat

References (2)

Richelieu: Self-Evolving LLM-Based Agents for AI Diplomacy (2024)

A Note on Rich Incomplete Argumentation Frameworks (2020)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Richelieu Framework.