A (More) Realistic Evaluation Setup for Generalisation of Community Models on Malicious Content Detection
Abstract: Community models for malicious content detection, which take into account the context from a social graph alongside the content itself, have shown remarkable performance on benchmark datasets. Yet, misinformation and hate speech continue to propagate on social media networks. This mismatch can be partially attributed to the limitations of current evaluation setups that neglect the rapid evolution of online content and the underlying social graph. In this paper, we propose a novel evaluation setup for model generalisation based on our few-shot subgraph sampling approach. This setup tests for generalisation through few labelled examples in local explorations of a larger graph, emulating more realistic application settings. We show this to be a challenging inductive setup, wherein strong performance on the training graph is not indicative of performance on unseen tasks, domains, or graph structures. Lastly, we show that graph meta-learners trained with our proposed few-shot subgraph sampling outperform standard community models in the inductive setup. We make our code publicly available.
- Information Evolution in Social Networks. In Proceedings of the Ninth ACM International Conference on Web Search and Data Mining, pages 473–482.
- Evaluating Fake News Detection Models from Explainable Machine Learning Perspectives. In ICC 2021 - IEEE International Conference on Communications, pages 1–6.
- A Review on Language Models as Knowledge Bases.
- Hunt Allcott and Matthew Gentzkow. 2017. Social media and fake news in the 2016 election. Journal of Economic Perspectives, 31(2):211–36.
- How to train your MAML.
- Learning to Few-Shot Learn Across Diverse Natural Language Classification Tasks. In Proceedings of the 28th International Conference on Computational Linguistics, pages 5108–5123, Barcelona, Spain (Online). International Committee on Computational Linguistics.
- Lia Bozarth and Ceren Budak. 2020. Toward a Better Performance Evaluation Framework for Fake News Classification. Proceedings of the International AAAI Conference on Web and Social Media, 14:60–71.
- How Attentive are Graph Attention Networks?
- Graph-based Modeling of Online Communities for Fake News Detection.
- Davide Chicco and Giuseppe Jurman. 2020. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genomics, 21(1):6.
- Davide Chicco and Giuseppe Jurman. 2022. An Invitation to Greater Use of Matthews Correlation Coefficient in Robotics and Artificial Intelligence. Frontiers in Robotics and AI, 9.
- Detecting Hate Speech with GPT-3.
- Limeng Cui and Dongwon Lee. 2020. CoAID: COVID-19 Healthcare Misinformation Dataset.
- Carner Derron. 2021. Commission on Information Disorder Final Report. https://www.aspeninstitute.org/publications/commission-on-information-disorder-final-report/.
- MetaDetector: Meta Event Knowledge Transfer for Fake News Detection. ACM Transactions on Intelligent Systems and Technology.
- Investigating Meta-Learning Algorithms for Low-Resource Natural Language Understanding Tasks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 1192–1197, Hong Kong, China. Association for Computational Linguistics.
- Brussels European Commission. 2018. Flash eurobarometer 464 (fake news and disinformation online). GESIS Data Archive, Cologne. ZA6934 Data file Version 1.0.0, https://doi.org/10.4232/1.13019.
- Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. CoRR.
- Peter Flach and Meelis Kull. 2015. Precision-Recall-Gain Curves: PR Analysis Done Right. In Advances in Neural Information Processing Systems, volume 28. Curran Associates, Inc.
- Fake News Detection Through Graph-based Neural Networks: A Survey.
- How does Truth Evolve into Fake News? An Empirical Study of Fake News Evolution. In Companion Proceedings of the Web Conference 2021, pages 407–411, Ljubljana Slovenia. ACM.
- Learning to Learn to Disambiguate: Meta-Learning for Few-Shot Word Sense Disambiguation.
- Benjamin Horne. 2019. NELA2017.
- Benjamin Horne and Mauricio Gruppi. 2021. NELA-GT-2020.
- Benjamin Horne and Mauricio Gruppi. 2023. NELA-GT-2022.
- Kexin Huang and Marinka Zitnik. 2021. Graph Meta Learning via Local Subgraphs.
- Meta-prompt based learning for low-resource false information detection. Information Processing & Management, 60(3):103279.
- The interplay between communities and homophily in semi-supervised classification using graph neural networks. Applied Network Science, 6(1):1–26.
- Hetero-SCAN: Towards Social Context Aware Fake News Detection via Heterogeneous Graph Neural Network. arXiv: Social and Information Networks.
- Diederik P. Kingma and Jimmy Ba. 2017. Adam: A Method for Stochastic Optimization.
- Meta Learning for Natural Language Processing: A Survey.
- Meta Learning and Its Applications to Natural Language Processing. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: Tutorial Abstracts, pages 15–20, Online. Association for Computational Linguistics.
- Towards Few-shot Fact-Checking via Perplexity. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 1971–1981, Online. Association for Computational Linguistics.
- On Unifying Misinformation Detection.
- Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods.
- Detect Rumors in Microblog Posts for Low-Resource Domains via Adversarial Contrastive Learning. In Findings of the Association for Computational Linguistics: NAACL 2022, pages 2543–2556, Seattle, United States. Association for Computational Linguistics.
- RoBERTa: A Robustly Optimized BERT Pretraining Approach.
- Ilya Loshchilov and Frank Hutter. 2019. Decoupled Weight Decay Regularization.
- Is Homophily a Necessity for Graph Neural Networks?
- MetaLearning with Graph Neural Networks: Methods and Applications. ACM SIGKDD Explorations Newsletter, 23(2):13–22.
- Tackling fake news detection by continually improving social context representations using graph neural networks. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1363–1380, Dublin, Ireland. Association for Computational Linguistics.
- Author Profiling for Abuse Detection. In Proceedings of the 27th International Conference on Computational Linguistics, pages 1088–1098, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
- Abusive Language Detection with Graph Convolutional Networks.
- Node masking: Making graph neural networks generalize and scale better. CoRR, abs/2001.07524.
- Modeling users and online communities for abuse detection: A position on ethics and explainability. In Findings of the Association for Computational Linguistics: EMNLP 2021, pages 3374–3385, Punta Cana, Dominican Republic. Association for Computational Linguistics.
- Fake News Detection on Social Media using Geometric Deep Learning.
- Domain Adaptive Fake News Detection via Reinforcement Learning. In Proceedings of the ACM Web Conference 2022, pages 3632–3640.
- Karsten Müller and Carlo Schwarz. 2017. Fanning the flames of hate: Social media and hate crime. SSRN Electronic Journal.
- M. E. J. Newman. 2003. Mixing patterns in networks. Physical Review E, 67(2):026126.
- FANG: Leveraging social context for fake news detection using graph representation. Communications of the ACM, 65(4):124–132.
- On First-Order Meta-Learning Algorithms.
- Dan Saattrup Nielsen and Ryan McConville. 2022. MuMiN: A Large-Scale Multilingual Multimodal Fact-Checked Misinformation Social Network Dataset.
- BOIL: Towards Representation Change for Few-shot Learning.
- Fake news detection: A survey of graph neural network methods. Applied Soft Computing, 139:110235.
- Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML.
- Adversarial Active Learning based Heterogeneous Graph Neural Network for Fake News Detection.
- Studying Fake News Spreading, Polarisation Dynamics, and Manipulation by Bots: A Tale of Networks and Language. Computer Science Review, 47:100531.
- Meta-Analysis with R. Use R! Springer International Publishing, Cham.
- FakeNewsNet: A Data Repository with News Content, Social Context and Spatialtemporal Information for Studying Fake News on Social Media.
- Beyond News Contents: The Role of Social Context for Fake News Detection. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, pages 312–320, Melbourne VIC Australia. ACM.
- Prototypical Networks for Few-shot Learning.
- Temporally evolving graph neural network for fake news detection. Information Processing and Management: an International Journal, 58(6).
- Graph-based semi-supervised learning: A comprehensive review. CoRR, abs/2102.13303.
- Sharing news with online friends: A study of network homophily, network size, and news type. Telematics and Informatics, 67:101763.
- Botpercent: Estimating bot populations in twitter communities.
- Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples.
- Multilingual and cross-lingual document classification: A meta-learning approach. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pages 1966–1976, Online. Association for Computational Linguistics.
- Graph Attention Networks.
- The spread of true and false news online. Science, 359(6380):1146–1151.
- Zeerak Waseem and Dirk Hovy. 2016. Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter. In Proceedings of the NAACL Student Research Workshop, pages 88–93, San Diego, California. Association for Computational Linguistics.
- Daniel Wiessner. 2021. Judge OKs $85 mln settlement of Facebook moderators’ PTSD claims. Reuters.
- Diverse Few-Shot Text Classification with Multiple Metrics. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 1206–1215, New Orleans, Louisiana. Association for Computational Linguistics.
- Contrastive Domain Adaptation for Early Misinformation Detection: A Case Study on COVID-19. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management, pages 2423–2433, Atlanta GA USA. ACM.
- MetaAdapt: Domain Adaptive Few-Shot Misinformation Detection via Meta Learning.
- GraphSAINT: Graph Sampling Based Inductive Learning Method.
- Few-Shot Learning on Graphs.
- Learning to Detect Few-Shot-Few-Clue Misinformation.
- BDANN: BERT-Based Domain Adaptation Neural Network for Multi-Modal Fake News Detection. In 2020 International Joint Conference on Neural Networks (IJCNN), pages 1–8.
- Beyond Homophily in Graph Neural Networks: Current Limitations and Effective Designs.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.