Transferability and harms of agent intergroup bias in real-world deployments
Determine the extent to which the intergroup bias exhibited by LLM-powered agents in minimal-group allocation simulations transfers to real-world deployments, and characterize the specific harms this bias may cause in human-facing, high-stakes contexts by evaluating agents on richer tasks with longer interaction horizons and domain-specific assessments.
References
The extent to which such bias transfers to real deployments, and what harms it may cause in human-facing, high-stakes contexts, remains to be established with richer tasks, longer horizons, and domain-specific evaluations.
— When Agents See Humans as the Outgroup: Belief-Dependent Bias in LLM-Powered Agents
(2601.00240 - Wang et al., 1 Jan 2026) in Limitations (Section)