- The paper introduces a multi-agent framework that automates each stage of the ML pipeline, enhancing efficiency and accessibility.
- It leverages a retrieval-augmented planning strategy to decompose tasks into specialized sub-tasks, ensuring precise and efficient execution.
- Experimental validation demonstrates a 100% success rate in deployable model generation under constraint-free scenarios and superior performance in constraint-aware settings.
AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML
The paper "AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML" (2410.02958) introduces a novel multi-agent framework designed to automate the entire process of machine learning model development, from data retrieval to model deployment, using LLMs. This framework, termed AutoML-Agent, aims to enhance the accessibility and efficiency of automated machine learning (AutoML) by allowing users to interact with a natural language interface.
Framework Overview
AutoML-Agent is structured around several specialized agents that collaborate to fulfill user instructions. The framework consists of the following key components:
- Agent Manager (Amgr): Acts as the core interface with users, devises plans, assigns tasks to agents, and verifies execution results.
- Prompt Agent (Ap): Parses user instructions into a standardized format to streamline communication between agents.
- Data Agent (Ad): Handles data-related tasks, including retrieval, preprocessing, and analysis.
- Model Agent (Am): Focuses on model search, hyperparameter optimization, and candidate ranking.
- Operation Agent (Ao): Implements the final solution by producing deployment-ready code.
Figure 1: Overview of our AutoML-Agent framework. (1) Initialization stage aims to receive a valid user instruction using request verification. (2) Planning stage focuses on extracting ML-related information by parsing the user instruction into a standardized form, and uses it to devise plans accordingly. (3) Execution stage executes each action given by the devised plans. Finally, based on the best execution results, AutoML-Agent outputs codes containing deployable models to the user.
Retrieval-Augmented Planning and Execution
To tackle the complexities of full-pipeline AutoML, AutoML-Agent introduces a retrieval-augmented planning (RAP) strategy. This strategy generates multiple plans based on past knowledge and current research, enabling parallel exploration of different solutions. Each plan is decomposed into sub-tasks tailored to the roles of the agents, allowing for efficient and focused execution without the need for further model training.
The multi-stage verification system ensures the reliability of the model by checking the validity of user requests, execution outcomes, and implementation results. This iterative feedback mechanism helps refine the generated solutions and adapt them to the user's requirements.
Figure 2: Performance comparison across all datasets using the SR, NPS, and CS metrics under (a) constraint-free and (b) constraint-aware settings. Higher scores indicate better results.
Experimental Validation
The AutoML-Agent framework was evaluated across a variety of datasets and machine learning tasks, demonstrating superior performance compared to existing methods such as GPT-3.5, GPT-4, and DS-Agent. In particular, AutoML-Agent achieved a 100% success rate in generating deployable models under constraint-free scenarios and consistently outperformed baselines in constraint-aware settings.

Figure 3: Results of (a) ablation and (b) hyperparameter studies in the CS metric.
Implications and Future Work
AutoML-Agent represents a significant advancement in making the power of AutoML accessible beyond expert practitioners, enabling efficient development of AI models even for users with limited technical knowledge. Its framework demonstrates the potential to further democratize AI development by simplifying complex processes.
Future research may focus on extending the framework's capabilities to support more diverse types of machine learning tasks, such as reinforcement learning and recommendation systems, as well as improving its robustness to operate independently of specific LLM backbones. Addressing these areas will expand the applicability and effectiveness of AutoML-Agent in complex real-world environments.
Conclusion
AutoML-Agent offers a comprehensive and efficient solution for automating the full pipeline of machine learning development, leveraging the capabilities of LLMs through a multi-agent framework. By addressing planning complexity and implementation accuracy, it sets a new standard for accessible AI-driven solutions. As the framework evolves, it is poised to further simplify the creation of sophisticated machine learning models, enhancing the potential for widespread AI innovation.