Benchmarking LLM Polarization Risk
Benchmark the polarization risk associated with AI-generated political content produced by frontier large language models by systematically measuring whether and to what extent such content amplifies political polarization.
References
Besides the limitations we discuss above, several extensions remain open. First, beyond persuasion risk, it is also concerning that AI-generated political content may amplify polarization \citep{goldstein2023generative, hackenburg2025comparing}. Benchmarking against LLM polarization risk would therefore be consequential.
— Benchmarking Political Persuasion Risks Across Frontier Large Language Models
(2603.09884 - Chen et al., 10 Mar 2026) in Conclusion