Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media
Abstract: We present the Multi-Modal Discussion Transformer (mDT), a novel methodfor detecting hate speech in online social networks such as Reddit discussions. In contrast to traditional comment-only methods, our approach to labelling a comment as hate speech involves a holistic analysis of text and images grounded in the discussion context. This is done by leveraging graph transformers to capture the contextual relationships in the discussion surrounding a comment and grounding the interwoven fusion layers that combine text and image embeddings instead of processing modalities separately. To evaluate our work, we present a new dataset, HatefulDiscussions, comprising complete multi-modal discussions from multiple online communities on Reddit. We compare the performance of our model to baselines that only process individual comments and conduct extensive ablation studies.
- Prompting for Multimodal Hateful Meme Classification. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 321–332. Abu Dhabi, United Arab Emirates: Association for Computational Linguistics.
- HateBERT: Retraining BERT for Abusive Language Detection in English. In Proceedings of the 5th Workshop on Online Abuse and Harms (WOAH 2021), 17–25. Online: Association for Computational Linguistics.
- Title-and-Tag Contrastive Vision-and-Language Transformer for Social Media Popularity Prediction. In Proceedings of the 30th ACM International Conference on Multimedia, MM ’22, 7008–7012. New York, NY, USA: Association for Computing Machinery. ISBN 9781450392037.
- Hate Speech in Online Social Media. SIGWEB Newsl., (Autumn).
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.
- Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior. Proceedings of the International AAAI Conference on Web and Social Media, 12(1).
- Abusive language detection in heterogeneous contexts: Dataset collection and the role of supervised attention. In Proceedings of the AAAI Conference on Artificial Intelligence (AISI Track), volume 35, 14804–14812.
- Hanu, L.; and Unitary team. 2020. Detoxify. Github. https://github.com/unitaryai/detoxify.
- Qualitative Analysis of a Graph Transformer Approach to Addressing Hate Speech: Adapting to Dynamically Changing Content. Workshop in Artificial Intelligence for Social Good (AISG) at AAAI.
- Predicting Hateful Discussions on Reddit using Graph Transformer Networks and Communal Context. In 2022 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT), 9–17.
- Sentiment analysis on product reviews using machine learning techniques. In Cognitive Informatics and Soft Computing: Proceeding of CISC 2017, 639–647. Springer.
- The hateful memes challenge: Detecting hate speech in multimodal memes.
- ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision. In Proceedings of the 38th International Conference on Machine Learning, 5583–5594. PMLR. ISSN: 2640-3498.
- Towards a Comprehensive Taxonomy and Large-Scale Annotated Corpus for Online Slur Usage. In Proceedings of the Fourth Workshop on Online Abuse and Harms, 138–149. Online: Association for Computational Linguistics.
- Multi-Modal Sarcasm Detection via Cross-Modal Graph Convolutional Network. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 1767–1777. Dublin, Ireland: Association for Computational Linguistics.
- ViLBERT: pretraining task-agnostic visiolinguistic representations for vision-and-language tasks. In Proceedings of the 33rd International Conference on Neural Information Processing Systems, 2, 13–23. Red Hook, NY, USA: Curran Associates Inc.
- Understanding Political Polarization via Jointly Modeling Users, Connections and Multimodal Contents on Heterogeneous Graphs. In Proceedings of the 30th ACM International Conference on Multimedia, MM ’22, 4072–4082. New York, NY, USA: Association for Computing Machinery. ISBN 9781450392037.
- Hatexplain: A benchmark dataset for explainable hate speech detection. In Proceedings of the AAAI Conference on Artificial Intelligence (AISI Track), volume 35, 14867–14875.
- Meta. 2023. Meta Reports First Quarter 2023 Results. Meta Investor Relations. https://investor.fb.com/investor-news/press-release-details/2023/Meta-Reports-First-Quarter-2023-Results/default.aspx.
- Tweetnerd-end to end entity linking benchmark for tweets. Advances in Neural Information Processing Systems, 35: 1419–1433.
- Attention Bottlenecks for Multimodal Fusion. In Ranzato, M.; Beygelzimer, A.; Dauphin, Y.; Liang, P.; and Vaughan, J. W., eds., Advances in Neural Information Processing Systems, volume 34, 14200–14213. Curran Associates, Inc.
- Learning Reddit User Reputation Using Graphical Attention Networks. In Future Technologies Conference (FTC) 2020, Volume 1, 777–789. ISBN 978-3-030-63128-4.
- A Benchmark Dataset for Learning to Intervene in Online Hate Speech. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 4755–4764. Hong Kong, China: Association for Computational Linguistics.
- Learning transferable visual models from natural language supervision. In International conference on machine learning, 8748–8763. PMLR.
- Towards a multi-agent system for online hate speech detection. Second Workshop on Autonomous Agents for Social Good (AASG), AAMAS, 2021.
- Sentiment and Emotion-Aware Multi-Modal Complaint Identification. Proceedings of the AAAI Conference on Artificial Intelligence (AISI Track), 36(11): 12163–12171.
- An Efficient Multi-View Multimodal Data Processing Framework for Social Media Popularity Prediction. In Proceedings of the 30th ACM International Conference on Multimedia, MM ’22, 7200–7204. New York, NY, USA: Association for Computing Machinery. ISBN 9781450392037.
- Hate speech harms: a social justice discussion of disabled Norwegians’ experiences. Disability & Society, 34(3): 368–383.
- Introducing CAD: the Contextual Abuse Dataset. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2289–2303. Online: Association for Computational Linguistics.
- Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), 1667–1682. Online: Association for Computational Linguistics.
- Online hate speech victimization and depressive symptoms among adolescents: The protective role of resilience. Cyberpsychology, Behavior, and Social Networking.
- Quantifying social organization and political polarization in online platforms. Nature, 600(7888): 264–268.
- Do transformers really perform badly for graph representation? Advances in Neural Information Processing Systems, 34: 28877–28888.
- Predicting the Type and Target of Offensive Posts in Social Media. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), 1415–1420. Minneapolis, Minnesota: Association for Computational Linguistics.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.