Can Language Models Recognize Convincing Arguments?
Abstract: The capabilities of LLMs have raised concerns about their potential to create and propagate convincing narratives. Here, we study their performance in detecting convincing arguments to gain insights into LLMs' persuasive capabilities without directly engaging in experimentation with humans. We extend a dataset by Durmus and Cardie (2018) with debates, votes, and user traits and propose tasks measuring LLMs' ability to (1) distinguish between strong and weak arguments, (2) predict stances based on beliefs and demographic characteristics, and (3) determine the appeal of an argument to an individual based on their traits. We show that LLMs perform on par with humans in these tasks and that combining predictions from different LLMs yields significant performance gains, surpassing human performance. The data and code released with this paper contribute to the crucial effort of continuously evaluating and monitoring LLMs' capabilities and potential impact. (https://go.epfl.ch/persuasion-LLM)
- MEGA: Multilingual evaluation of generative AI. In Houda Bouamor, Juan Pino, and Kalika Bali (eds.), Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pp. 4232–4267, Singapore, December 2023. Association for Computational Linguistics. doi: 10.18653/v1/2023.emnlp-main.258. URL https://aclanthology.org/2023.emnlp-main.258.
- Exploiting personal characteristics of debaters for predicting persuasiveness. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 7067–7072, 2020.
- Artificial intelligence can persuade humans on political issues, February 2023.
- On the opportunities and risks of foundation models. arXiv preprint arXiv:2108.07258, 2021.
- Weaponized health communication: Twitter bots and russian trolls amplify the vaccine debate. American journal of public health, 108(10):1378–1384, 2018.
- Truth, lies, and automation. Center for Security and Emerging Technology, 1(1):2, 2021.
- Five years of argument mining: A data-driven analysis. In IJCAI, volume 18, pp. 5427–5433, 2018.
- The American Voter. Wiley, 1960.
- Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD ’16, pp. 785–794, New York, NY, USA, 2016. Association for Computing Machinery. ISBN 9781450342322. doi: 10.1145/2939672.2939785. URL https://doi.org/10.1145/2939672.2939785.
- The small effects of political advertising are small regardless of context, message, sender, or receiver: Evidence from 59 real-time randomized experiments. Science advances, 6(36):eabc4046, 2020.
- Chatgpt and the rise of large language models: the new ai-driven infodemic threat in public health. Frontiers in Public Health, 11:1166120, 2023.
- The tactics & tropes of the internet research agency. 2019.
- Exploring the role of prior beliefs for argument persuasion. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), 2018.
- American public opinion: Its origins, content, and impact. Routledge, 2019.
- Generative language models and automated influence operations: Emerging threats and potential mitigations. arXiv preprint arXiv:2301.04246, 2023.
- Does counter-attitudinal information cause backlash? results from three large survey experiments. British Journal of Political Science, 50(4):1497–1515, 2020.
- Personalized persuasion: Tailoring persuasive appeals to recipients’ personality traits. Psychological science, 23(6):578–581, 2012.
- Quantifying the persona effect in llm simulations. arXiv preprint arXiv:2402.10811, 2024.
- Logical self-defense. Idea, 2006.
- Argument mining: A survey. Computational Linguistics, 45(4):765–818, 2020.
- Overview of imagearg-2023: The first shared task in multimodal argument mining. In Proceedings of the 10th Workshop on Argument Mining, pp. 120–132, 2023.
- Psychological targeting as an effective approach to digital mass persuasion. Proceedings of the national academy of sciences, 114(48):12714–12719, 2017.
- Rachel Minkin. Diversity, equity and inclusion in the workplace: A survey report, 2023.
- Daniel J O’Keefe. Persuasion: Theory and research. Sage Publications, 2015.
- Gender, age, and responsiveness to cialdini’s persuasion strategies. In Persuasive Technology: 10th International Conference, PERSUASIVE 2015, Chicago, IL, USA, June 3-5, 2015, Proceedings 10, pp. 147–159. Springer, 2015.
- Argument quality and persuasive effects: A review of current approaches. In Argumentation and values: Proceedings of the ninth Alta conference on argumentation, pp. 88–92, 1995.
- On the conversational persuasiveness of large language models: A randomized controlled trial. arXiv preprint arXiv:2403.14380, 2024.
- Wisdom of the silicon crowd: Llm ensemble prediction capabilities match human crowd accuracy. arXiv preprint arXiv:2402.19379, 2024.
- The persuasive effects of political microtargeting in the age of generative artificial intelligence. PNAS Nexus, 3(2):pgae035, January 2024. ISSN 2752-6542. doi: 10.1093/pnasnexus/pgae035.
- Evaluating the social impact of generative ai systems in systems and society. arXiv preprint arXiv:2306.05949, 2023.
- Beyond memorization: Violating privacy via inference with large language models, 2023.
- Winning arguments: Interaction dynamics and persuasion strategies in good-faith online discussions. In Proceedings of the 25th international conference on world wide web, pp. 613–624, 2016.
- Quantifying the potential persuasive returns to political microtargeting. Proceedings of the National Academy of Sciences, 120(25):e2216261120, 2023.
- A review and conceptual framework for understanding personalized matching effects in persuasion. Journal of Consumer Psychology, 31(2):382–414, 2021.
- Computational argumentation quality assessment in natural language. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, pp. 176–187, 2017.
- Douglas Walton. Fundamentals of critical argumentation. Cambridge University Press, 2005.
- Sociotechnical safety evaluation of generative ai systems. arXiv preprint arXiv:2310.11986, 2023.
- The generative AI paradox: What it can create, it may not understand”. In The Twelfth International Conference on Learning Representations, 2023.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.