Impact of user communication diversity on agent performance and success
Quantify how variation in user communication styles, linguistic backgrounds, and cultural norms affects large language model agent performance and task success in task-oriented conversational settings, including dimensions such as formality, verbosity, and politeness norms.
References
For example, even in a simple retail assistance scenario, users might vary along dimensions such as formality, verbosity, and politeness normsâbut it remains unclear how much this diversity meaningfully impacts agent performance and task success \citep{truong-etal-2025-persona}.
— Lost in Simulation: LLM-Simulated Users are Unreliable Proxies for Human Users in Agentic Evaluations
(2601.17087 - Seshadri et al., 23 Jan 2026) in Introduction