AIOS: LLM Agent Operating System

Published 25 Mar 2024 in cs.OS, cs.AI, and cs.CL | (2403.16971v5)

Abstract: LLM-based intelligent agents face significant deployment challenges, particularly related to resource management. Allowing unrestricted access to LLM or tool resources can lead to inefficient or even potentially harmful resource allocation and utilization for agents. Furthermore, the absence of proper scheduling and resource management mechanisms in current agent designs hinders concurrent processing and limits overall system efficiency. To address these challenges, this paper proposes the architecture of AIOS (LLM-based AI Agent Operating System) under the context of managing LLM-based agents. It introduces a novel architecture for serving LLM-based agents by isolating resources and LLM-specific services from agent applications into an AIOS kernel. This AIOS kernel provides fundamental services (e.g., scheduling, context management, memory management, storage management, access control) for runtime agents. To enhance usability, AIOS also includes an AIOS SDK, a comprehensive suite of APIs designed for utilizing functionalities provided by the AIOS kernel. Experimental results demonstrate that using AIOS can achieve up to 2.1x faster execution for serving agents built by various agent frameworks. The source code is available at https://github.com/agiresearch/AIOS.

Abstract PDF HTML Upgrade to Chat

References (68)

Citations (8)

View on Semantic Scholar

Summary

The paper introduces a novel AIOS that isolates LLM-specific services from traditional OS functions, enhancing resource allocation.
It employs classic scheduling algorithms, advanced context management, and an adaptive K-LRU memory policy to manage execution efficiently.
Empirical results demonstrate up to 2.1× faster execution and improved throughput, underscoring the system’s scalability for LLM agents.

AIOS: LLM Agent Operating System

Introduction

The paper "AIOS: LLM Agent Operating System" introduces the architecture of an LLM-based AI Agent Operating System (AIOS), specifically targeting the resource management challenges associated with deploying LLM-based intelligent agents. The primary objective of AIOS is to isolate resources and LLM-specific services from agent applications into an AIOS kernel, thereby improving resource allocation efficiency and enabling proper scheduling for concurrent processing. This operating system is designed to serve LLM-based agents by providing core functionalities such as scheduling, context management, memory management, storage management, and access control. AIOS includes an SDK with a comprehensive API suite to streamline the use of the AIOS kernel functionalities.

Figure 1: A motivating example of how an agent (i.e., travel agent) requires both LLM-related and Non-LLM-related (i.e., OS) services to complete a task, where color in red represents services related to LLM and color in blue represents services not related to LLM.

Architecture of AIOS

The architecture of AIOS is structured into three distinct layers: the application layer, the kernel layer, and the hardware layer.

Application Layer: This layer facilitates the design and development of agent applications. It provides interfaces via the AIOS SDK to request system resources, thereby abstracting the complexities involved in direct resource manipulation and ensuring system isolation.
Figure 2: An overview of the AIOS architecture of distinct layers. Application layer facilitates the design and development of agent applications. Kernel layer manages core functionalities and resources to serve agent applications. Hardware layer controls and manages physical computing resources and devices to support kernel layer functionalities.
Kernel Layer: The kernel layer encompasses the traditional OS kernel for non-LLM tasks and the AIOS kernel that handles LLM-specific functionalities. Within the AIOS kernel, different modules manage agent queries by decomposing them into execution units that the scheduler orchestrates. This layer provides specialized modules for LLM processing, memory and storage management, and tool usage, with features like context management for handling interruptions efficiently.
Hardware Layer: This controls and manages physical computing resources and devices to support the functionalities of the kernel layer. Although it is less of a focus in AIOS, its efficient management remains crucial for overall system performance.

Kernel Implementation

Scheduler and Context Manager

The scheduler in AIOS centralizes all requests, efficiently dispatching them to appropriate modules, and is designed to handle system calls with classic algorithms like FIFO and Round Robin (RR) to optimize resource distribution.

Figure 3: How agent queries are decomposed into AIOS system calls and how AIOS system calls are dispatched and scheduled. We omit the access manager module here as the access-related system calls will not be dispatched by the scheduler.

The context manager supports task interruption and resumption via snapshot and restoration processes, managing long-running system calls by preserving intermediate states using text-based and logits-based methods.

Figure 4: Illustration of the logits-based context snapshot and restoration process. We use beam search algorithm where beam width is set to 1 as an example.

Memory and Storage Management

The memory manager addresses runtime data storage and retrieval, ensuring efficient resource usage through a K-LRU eviction policy, swapping less accessed data to storage when necessary.

Figure 5: Illustration of memory and storage manager as well as their relationship. An agent's memory item in its memory block will be evicted to storage if its memory usage exceeds the memory limit, which is set to 80\% of the memory block size. This threshold is configurable through AIOS configuration.

The storage manager handles persistent data storage using mechanisms that ensure data integrity and scalability, supporting operations such as versioning and rollback.

Evaluation

AIOS demonstrates its effectiveness through extensive evaluations, achieving up to 2.1× faster execution for serving agents built by various frameworks. The empirical results consistently show AIOS maintaining agent performance across standard benchmarks while significantly improving execution efficiency and system throughput.

Figure 6: Overall execution time and average agent waiting time when agent number increases from 250 to 2000.

Conclusion

AIOS provides an innovative architecture for efficiently managing LLM-based agents by isolating resources and enhancing system functionalities through an AIOS kernel. It facilitates improved scalability, execution efficiency, and resource management for LLM-intensive applications. Future research could explore more advanced scheduling algorithms, optimization of context management, and safety enhancements to further leverage AIOS's potential in various real-world deployments.