AQuA -- Combining Experts' and Non-Experts' Views To Assess Deliberation Quality in Online Discussions Using LLMs
Abstract: Measuring the quality of contributions in political online discussions is crucial in deliberation research and computer science. Research has identified various indicators to assess online discussion quality, and with deep learning advancements, automating these measures has become feasible. While some studies focus on analyzing specific quality indicators, a comprehensive quality score incorporating various deliberative aspects is often preferred. In this work, we introduce AQuA, an additive score that calculates a unified deliberative quality score from multiple indices for each discussion post. Unlike other singular scores, AQuA preserves information on the deliberative aspects present in comments, enhancing model transparency. We develop adapter models for 20 deliberative indices, and calculate correlation coefficients between experts' annotations and the perceived deliberativeness by non-experts to weigh the individual indices into a single deliberative score. We demonstrate that the AQuA score can be computed easily from pre-trained adapters and aligns well with annotations on other datasets that have not be seen during training. The analysis of experts' vs. non-experts' annotations confirms theoretical findings in the social science literature.
- Text-based emotion detection: Advances, challenges, and opportunities. Engineering Reports, 2(7):e12189.
- Disentangling diversity in deliberative democracy: Competing theories, their blind spots and complementarities. Journal of Political Philosophy, 18(1):32–63.
- Measuring deliberation 2.0: standards, discourse types, and sequenzialization. In ECPR General Conference, pages 5–12. Potsdam.
- Nick Beauchamp. 2020. 321Modeling and Measuring Deliberation Online. In The Oxford Handbook of Networked Communication. Oxford University Press.
- Annotating social acts: authority claims and alignment moves in wikipedia talk pages. In Proceedings of the Workshop on Languages in Social Media, LSM ’11, page 48–57, USA. Association for Computational Linguistics.
- Laura W Black. 2008. Listening to the city: Difference, identity, and storytelling in online deliberative groups. Journal of Deliberative Democracy, 5(1).
- 83Discourse Quality Index. In Research Methods in Deliberative Democracy. Oxford University Press.
- German’s next language model. In Proceedings of the 28th International Conference on Computational Linguistics, pages 6788–6796, Barcelona, Spain (Online). International Committee on Computational Linguistics.
- Gina Masullo Chen. 2017. Online incivility and public debate: Nasty talk. Springer.
- Online and Uncivil? Patterns and Determinants of Incivility in Newspaper Website Comments. Journal of Communication, 64(4):658–679.
- BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186, Minneapolis, Minnesota. Association for Computational Linguistics.
- Nicholas Diakopoulos. 2015. Picking the nyt picks: Editorial criteria and automation in the curation of online news comments. ISOJ Journal, 5(1):147–166.
- John S Dryzek. 2002. Deliberative democracy and beyond: Liberals, critics, contestations. Oxford University Press, USA.
- Different arenas, different deliberative quality? using a systemic framework to evaluate online deliberation on immigration policy in germany. Policy & Internet, 13(1):86–112.
- Neele Falk and Gabriella Lapesa. 2023a. Bridging argument quality and deliberative quality annotations with adapters. In Findings of the Association for Computational Linguistics: EACL 2023, pages 2469–2488, Dubrovnik, Croatia. Association for Computational Linguistics.
- Neele Falk and Gabriella Lapesa. 2023b. StoryARG: a corpus of narratives and personal experiences in argumentative texts. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2350–2372, Toronto, Canada. Association for Computational Linguistics.
- Eleonore Fournier-Tombs and Giovanna Di Marzo Serugendo. 2020. DelibAnalysis: Understanding the quality of online political discourse with machine learning. Journal of Information Science, 46(6):810–822.
- Dennis Friess and Christiane Eilders. 2015. A systematic review of online deliberation research. Policy & Internet, 7(3):319–339.
- Collective civic moderation for deliberation? exploring the links between citizens’ organized engagement in comment sections and the deliberative quality of online discussions. Political Communication, 38(5):624–646.
- Frauke Gerlach and Christiane Eilders, editors. 2022. #meinfernsehen 2021. Nomos, Baden-Baden.
- Visual linguistic analysis of political discussions: Measuring deliberative quality. Digital Scholarship in the Humanities, 32(1):141–158.
- Todd Graham. 2010. The use of expressives in online political talk: Impeding or facilitating the normative goals of deliberation? In Electronic Participation, pages 26–41, Berlin, Heidelberg. Springer Berlin Heidelberg.
- Effects of empowerment moderation in online discussions: A field experiment with four news outlets. In 72nd Annual Conference of the International Communication Association (ICA).
- Parameter-efficient transfer learning for NLP. In Proceedings of the 36th International Conference on Machine Learning, volume 97 of Proceedings of Machine Learning Research, pages 2790–2799. PMLR.
- The sfu opinion and comments corpus: A corpus for the analysis of online news comments. Corpus Pragmatics, 4:155–190.
- John Lawrence and Chris Reed. 2020. Argument Mining: A Survey. Computational Linguistics, 45(4):765–818.
- Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692.
- QualityAdapt: an automatic dialogue quality estimation framework. In Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 83–90, Edinburgh, UK. Association for Computational Linguistics.
- Facebook FAIR’s WMT19 news translation task submission. In Proceedings of the Fourth Conference on Machine Translation (Volume 2: Shared Task Papers, Day 1), pages 314–319, Florence, Italy. Association for Computational Linguistics.
- Zizi Papacharissi. 2004. Democracy online: civility, politeness, and the democratic potential of online political discussion groups. New Media & Society, 6(2):259–283.
- AdapterFusion: Non-destructive task composition for transfer learning. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pages 487–503, Online. Association for Computational Linguistics.
- AdapterHub: A framework for adapting transformers. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 46–54, Online. Association for Computational Linguistics.
- Adapters: A unified library for parameter-efficient and modular transfer learning.
- Learning multiple visual domains with residual adapters. In Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc.
- Mary F Scudder. 2022. Measuring democratic listening: A listening quality index. Political research quarterly, 75(1):175–187.
- Bokyong Shin and Mikko Rask. 2021. Assessment of online deliberative quality: New indicators using network analysis and time-series analysis. Sustainability, 13(3).
- Measuring political deliberation: A discourse quality index. Comparative European Politics, 1:21–48.
- Yla R. Tausczik and James W. Pennebaker. 2010. The psychological meaning of words: Liwc and computerized text analysis methods. Journal of Language and Social Psychology, 29(1):24–54.
- Attention is all you need. In Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc.
- Bildungsbezogene Biases in crowd-annotierten Daten zur automatischen Klassifikation von konstruktiven und inzivilen Kommentaren (Educational biases in crowd-annotated data for the automatic classification of constructive and incivil comments). In Annual Conference of the Political Communication Devision of the German Association of Communication Science (DGPuK).
- A decline in the quality of debate? the evolution of cognitive complexity in swiss parliamentary debates on immigration (1968–2014). Swiss Political Science Review, 21(4):636–653.
- Linking news value theory with online deliberation: How news factors and illustration factors in news articles affect the deliberative quality of user discussions in sns’ comment sections. Communication Research, 47(6):860–890.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.