The Anatomy of a Personal Health Agent
Abstract: Health is a fundamental pillar of human wellness, and the rapid advancements in LLMs have driven the development of a new generation of health agents. However, the application of health agents to fulfill the diverse needs of individuals in daily non-clinical settings is underexplored. In this work, we aim to build a comprehensive personal health agent that is able to reason about multimodal data from everyday consumer wellness devices and common personal health records, and provide personalized health recommendations. To understand end-users' needs when interacting with such an assistant, we conducted an in-depth analysis of web search and health forum queries, alongside qualitative insights from users and health experts gathered through a user-centered design process. Based on these findings, we identified three major categories of consumer health needs, each of which is supported by a specialist sub-agent: (1) a data science agent that analyzes personal time-series wearable and health record data, (2) a health domain expert agent that integrates users' health and contextual data to generate accurate, personalized insights, and (3) a health coach agent that synthesizes data insights, guiding users using a specified psychological strategy and tracking users' progress. Furthermore, we propose and develop the Personal Health Agent (PHA), a multi-agent framework that enables dynamic, personalized interactions to address individual health needs. To evaluate each sub-agent and the multi-agent system, we conducted automated and human evaluations across 10 benchmark tasks, involving more than 7,000 annotations and 1,100 hours of effort from health experts and end-users. Our work represents the most comprehensive evaluation of a health agent to date and establishes a strong foundation towards the futuristic vision of a personal health agent accessible to everyone.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Explain it Like I'm 14
Explaining “The Anatomy of a Personal Health Agent” in Simple Terms
Overview
This paper imagines a smart helper for your health—a Personal Health Agent (PHA). It’s like a team of friendly experts in your phone that can look at your wearable data (like steps, sleep, heart rate), understand health facts, and coach you toward healthier habits. The goal is to show how such a helper could be built, what it should do, and how well it might work in everyday life. It’s a research idea, not a product you can download.
Key Questions the Paper Tries to Answer
To make this health helper truly useful, the researchers asked:
- What do people actually want help with when it comes to their health?
- What skills should a smart health helper have to answer those needs?
- How can we design it so different parts work together smoothly?
- Does the system give good, safe, and personalized advice?
- How does it perform when tested with real health data from wearables and lab tests?
How They Built and Tested It
The researchers used a “user-centered” approach, which means they started by learning what people need and then designed the system around those needs.
They looked at real questions people ask online (Google Search, Gemini, Fitbit forums), ran surveys with Fitbit users, and held a workshop with 14 experts. From this, they found four main types of requests:
- General health knowledge (facts and explanations),
- Personal data insights (what your own numbers mean),
- Wellness advice (how to improve),
- Personal symptoms (what might be going on with your body).
To handle these requests, they designed a team of three “sub-agents” that work together, like a sports team with different positions:
- Data Science Agent (DS): Think of this like a data detective. It analyzes your time-based numbers from wearables (steps, sleep, heart rate) and compares them to the general population. It answers questions such as “Has my running gotten faster?” or “Do I sleep more on active days?” It plans the right calculations and uses code to get accurate results.
- Time-series data = a diary of your body’s signals over time.
- Statistical tests = checks to see if a pattern is real or just random.
- Domain Expert Agent (DE): This is the health knowledge expert. It explains medical terms and puts your data in context. It can interpret lab results and daily signals (like HRV or blood pressure) and personalize answers based on your age, health history, and environment.
- Differential diagnosis = a careful list of possible reasons for symptoms, not a final medical answer.
- Health Coach Agent (HC): This is your motivator and guide. It helps you set goals, find barriers, and build action plans. It uses techniques like motivational interviewing, which means it asks supportive questions so you discover your own reasons to change, making plans that fit your life.
These three are coordinated by an “orchestrator”—like a team coach—that decides which agent should do what, combines their answers, and helps the conversation flow.
To test the system, the team used:
- Mixed evaluations: both automated checks and human reviews.
- Real-world data from a study called WEAR-ME, where about 1,165 participants (Fitbit/Pixel Watch users) consented to share wearable data, answered health surveys, and got lab tests (like cholesterol and metabolic panels). Everything followed research ethics and privacy rules.
They ran 10 different benchmark tasks and collected more than 7,000 ratings, spending about 1,120 hours with experts and end-users to judge the system’s performance.
Main Findings and Why They Matter
Here’s what they found in simple terms:
- The Data Science Agent improved the quality of analysis plans and made fewer coding mistakes than a standard LLM. In other words, it was better at turning a vague question like “Am I sleeping well?” into a clear plan, correctly running the analysis, and giving accurate numbers.
- The Domain Expert Agent showed strong medical knowledge and could better personalize explanations by using the person’s data (wearables + lab results + background).
- The Health Coach Agent provided more useful, motivating conversations, according to both everyday users and professional health coaches.
- The whole multi-agent system (PHA) worked well when everything was combined: data insights + expert knowledge + coaching. End-users and experts judged it as more helpful and consistent in multi-turn, open-ended conversations about health goals.
Why this matters: Most health apps either show raw numbers, offer general advice, or give basic facts. This system tries to connect all three—analyzing your data, explaining what it means, and turning it into practical steps—so guidance is personalized and more likely to help you change habits.
What This Could Mean for the Future
If developed safely and responsibly, a personal health agent could:
- Help people understand their wearable data in plain language and spot meaningful patterns.
- Offer advice tailored to their goals and lifestyle, not one-size-fits-all tips.
- Encourage long-term healthy habits through supportive coaching, not just information dumps.
- Support, not replace, healthcare professionals—pointing people to real medical care when needed.
Important notes:
- This is a research framework, not a commercial product, and it’s meant for everyday wellness support rather than medical diagnosis.
- The system is designed to complement human experts and guide people to clinical resources for serious issues.
- Privacy and consent were central in the data used for testing.
In short, the paper shows how a “team” of smart agents could make health guidance more personal, accurate, and motivating—bringing us closer to a helpful, everyday health companion that’s accessible to everyone.
Collections
Sign up for free to add this paper to one or more collections.