Papers
Topics
Authors
Recent
Search
2000 character limit reached

MegaAgent: A Large-Scale Autonomous LLM-based Multi-Agent System Without Predefined SOPs

Published 19 Aug 2024 in cs.MA | (2408.09955v3)

Abstract: LLM-based multi-agent systems (MAS) have shown promise in tackling complex tasks. However, existing solutions often suffer from limited agent coordination and heavy reliance on predefined Standard Operating Procedures (SOPs), which demand extensive human input. To address these limitations, we propose MegaAgent, a large-scale autonomous LLM-based multi-agent system. MegaAgent generates agents based on task complexity and enables dynamic task decomposition, parallel execution, efficient communication, and comprehensive system monitoring of agents. In evaluations, MegaAgent demonstrates exceptional performance, successfully developing a Gobang game within 800 seconds and scaling up to 590 agents in a national policy simulation to generate multi-domain policies. It significantly outperforms existing systems, such as MetaGPT, in both task completion efficiency and scalability. By eliminating the need for predefined SOPs, MegaAgent demonstrates exceptional scalability and autonomy, setting a foundation for advancing true autonomy in MAS. Our code is available at https://github.com/Xtra-Computing/MegaAgent .

Citations (3)

Summary

  • The paper demonstrates a novel hierarchical framework that dynamically splits tasks for autonomous agent generation and coordination in LLM-based systems.
  • It utilizes multi-level task splitting, parallel execution, and integrated modules (action, storage, monitor) to optimize performance, with empirical validation via Gobang and policy simulation experiments.
  • The framework significantly enhances scalability and efficiency by enabling dynamic, concurrent operations without relying on predefined SOPs, paving the way for future agent systems.

MegaAgent: A Large-Scale Autonomous LLM-Based Multi-Agent System

Introduction

MegaAgent is an innovative framework designed to address the limitations of predefined Standard Operating Procedures (SOPs) and scalability in LLM-powered multi-agent systems (LLM-MA systems). Traditional systems often lack autonomy and rely on fixed protocols, leading to inefficiencies in complex task environments. MegaAgent introduces a hierarchical structure to dynamically generate agents based on task requirements, enabling effective cooperation and concurrent operations without predefined SOPs. This approach enhances scalability, autonomy, and performance, paving the way for future developments in large-scale autonomous systems.

Framework Structure

MegaAgent comprises several key components: Multi-Level Task Splitting, Action Module, Storage Module, Monitor Mechanism, and Communication Mechanism.

  1. Multi-Level Task Splitting: MegaAgent starts by dividing a given task into multiple hierarchical sub-tasks, each assigned to a group of agents. A boss agent orchestrates task assignment, ensuring that tasks are properly distributed among sub-task admins who oversee individual agent groups. This hierarchical design facilitates task management and scalability by distributing complexity across different levels, allowing for efficient handling of large-scale interactions.
  2. Action Module: This component is responsible for executing sub-tasks. Agents within a group create checklists to manage their actions, while an OS agent monitors outputs to ensure they meet predefined formats. The Action Module supports parallelism, allowing multiple agent groups to operate simultaneously, significantly reducing task completion time compared to sequential systems.
  3. Storage Module: It facilitates communication between agents and external files, integrating Git management, memory databases for long-term storage, and a task monitor to ensure task progress and organization. This module helps maintain data consistency and provides a comprehensive framework for collaborative agent operations.
  4. Monitor Mechanism: This employs an OS agent and checklists to minimize errors and ensure that agents follow expected workflows. Each agent tracks its progress through checklists, which are regularly updated to ensure alignment with overarching goals.
  5. Communication Mechanism: Agents communicate within defined hierarchies, interacting with direct superiors or subordinates. This structured format supports effective collaboration within and across agent groups, enhancing overall communication efficiency. Figure 1

    Figure 1: MegaAgent starts by accepting prompts, which it automatically divides into tasks for multiple agent groups. OS Agent monitors the agents' progress, while the action module interacts with the storage module through a data retriever.

Experimental Evaluation

MegaAgent was evaluated through two experiments: Gobang game development and national policy simulation.

  1. Gobang Game Development: MegaAgent demonstrated superior autonomy and efficiency, completing the task within 800 seconds with a naive AI, outperforming other LLM-MA systems like AutoGen, MetaGPT, CAMEL, and AgentVerse. The framework successfully coordinated task execution across seven agents, leveraging parallelism to optimize performance. Figure 2

    Figure 2: Interface of MegaAgent's Gobang Demo

  2. National Policy Simulation: MegaAgent showcased its scalability by managing approximately 590 agents to generate detailed policies across various sectors within 3000 seconds. The framework's hierarchical structure facilitated efficient policy development, highlighting its capability to handle complex, large-scale scenarios. Figure 3

    Figure 3: An example of the generated hierarchy for the policy making experiment

Implications and Future Directions

MegaAgent establishes a foundation for autonomous cooperation in LLM-MA systems, serving as a potential operating system for future large-scale agent frameworks. It addresses the critical need for scalable infrastructure and dynamic task management in complex environments. Future research may explore optimizing communication efficiency, integrating diverse LLMs for cost-effective operations, and further reducing hallucinations by leveraging external knowledge bases.

Conclusion

MegaAgent presents a practical solution for managing large-scale LLM-MA systems, emphasizing autonomy, scalability, and efficient cooperation. Through innovative hierarchical structuring and parallel execution capabilities, it surpasses existing frameworks in task completion efficacy. The framework lays the groundwork for future exploration in autonomous multi-agent systems, drawing attention to enhancing effective agent cooperation in increasingly complex scenarios.

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Glossary

  • Admin agent: An agent responsible for overseeing a subtask and managing agents within its group. "The admin agent then generates a group of agents to complete the task."
  • AlphaBeta algorithm: A search algorithm that prunes branches in minimax to optimize decision-making in games. "the current ChatGPT API is not capable of developing the AlphaBeta algorithm without errors by itself."
  • Boss agent: The top-level agent that decomposes a task into subtasks and assigns them to admins. "a boss agent receives the prompt, divides the task into sub-tasks, and assigns each sub-task to a corresponding sub-task admin."
  • Centralized communication: A coordination paradigm where a central agent governs interactions among other agents. "Centralized communication involves a central agent or a group of central agents coordinating the system's communication, with other agents primarily connecting to the central agent."
  • Checklist: A structured list maintained by agents to track and verify their actions. "each agent maintains a checklist to track its actions."
  • Cosine similarity metric: A measure of similarity between two vectors based on the cosine of the angle between them. "so that the agent can fetch the most similar message into its memory by cosine similarity metric."
  • Data retriever: A component that mediates function calls between agent actions and the storage module. "we introduce a data retriever as an interface between action module and storage module."
  • Decentralized communication: A peer-to-peer coordination paradigm without a central controller. "Decentralized communication operates on a peer-to-peer basis among agents."
  • Embeddings: Vector representations of text or data used for retrieval and similarity search. "The output of each round is encoded using LLMs into embeddings to be stored in the vector database"
  • Global mutex lock: A mutual exclusion mechanism used to serialize operations across the system. "All Git operations are serialized using a global mutex lock."
  • HEAD (Git): The reference to the latest commit in a Git repository. "merges them to the latest HEAD"
  • Hierarchical mechanism: A layered organizational structure for task decomposition and management. "we introduce a hierarchical mechanism: a boss agent receives the prompt, divides the task into sub-tasks"
  • Layered communication: Communication organized by hierarchical levels, with interactions across adjacent layers. "Layered communication is organized hierarchically, with agents at each level having distinct roles and each layer interacting with adjacent layers"
  • LLM-MA systems: LLM-powered multi-agent systems for cooperative task solving. "LLM-powered multi-agent systems (LLM-MA systems) have been proposed to tackle real-world tasks."
  • Message pool: A shared channel where agents publish and subscribe to messages to coordinate. "A shared message pool, as proposed in MetaGPT \cite{hong2023metagpt}, maintains a shared message pool where agents publish and subscribe to relevant messages, boosting communication efficiency."
  • Meta-programming: A programming approach where code generates or manipulates other code or workflows. "MetaGPT introduces a meta-programming framework that effectively simulates the software development process"
  • OS agent: A supervisory agent that enforces output formats and monitors agent-group progress. "The OS agent is employed to monitor the agent groups' output to ensure they chat in the right format;"
  • Publish and subscribe: A messaging pattern where producers publish messages and consumers subscribe to relevant topics. "agents publish and subscribe to relevant messages"
  • Sequential pipeline: A linear execution model where steps proceed one after another without parallelism. "these systems utilize a sequential pipeline without considering parallel execution of agents"
  • Standard Operating Procedures (SOPs): Predefined instructions dictating how agents should respond to inputs or situations. "their agents mostly follow predefined Standard Operating Procedures (SOPs) that remain unchanged across the whole interaction"
  • System-level parallelism: Parallel execution across multiple components or agent groups to improve throughput. "employs system-level parallelism to enhance performance and boost communication."
  • Task monitor: A component that verifies outputs, reports to supervisors, and triggers rework when needed. "The task monitor is designed to verify the existence of output files, report their content to the supervising agent, and, if necessary, assign rework to the subordinate agent."
  • Temperature (parameter): A control parameter in LLMs that adjusts randomness in sampling. "setting the `temperature' parameter to $0$ to ensure more deterministic responses"
  • Vector database: A database optimized for storing and retrieving embeddings using similarity search. "to be stored in the vector database"
  • World simulations: Complex scenario simulations using many LLM agents to emulate real-world dynamics. "simulate more complex scenarios, known as world simulations"

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.