Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics

Published 21 Mar 2024 in cs.AI and cs.CR | (2403.14077v4)

Abstract: DeepFakes, which refer to AI-generated media content, have become an increasing concern due to their use as a means for disinformation. Detecting DeepFakes is currently solved with programmed machine learning algorithms. In this work, we investigate the capabilities of multimodal LLMs in DeepFake detection. We conducted qualitative and quantitative experiments to demonstrate multimodal LLMs and show that they can expose AI-generated images through careful experimental design and prompt engineering. This is interesting, considering that LLMs are not inherently tailored for media forensic tasks, and the process does not require programming. We discuss the limitations of multimodal LLMs for these tasks and suggest possible improvements.

Abstract PDF HTML Upgrade to Chat

References (41)

Citations (15)

View on Semantic Scholar

Summary

The paper demonstrates that leveraging multimodal inputs enables ChatGPT to detect AI-generated face images with a 75% AUC score.
The paper shows that refined prompt engineering reduces rejection rates and enhances detection accuracy in media forensics.
The paper implies that integrating LLMs into forensic workflows can offer user-friendly tools, though improvements are needed for real image identification.

Evaluating ChatGPT for DeepFake Detection in Media Forensics

The paper, "Can ChatGPT Detect DeepFakes? A Study of Using Multimodal LLMs for Media Forensics," explores the potential of employing multimodal LLMs for the detection of DeepFakes, a prevalent concern in the digital era. This work critically examines whether the capabilities of LLMs, specifically those that can process both textual and visual inputs, could be harnessed for media forensics without relying on complex programming.

Methodology and Experimental Design

The researchers focus on ChatGPT, particularly its iterations that can handle multimodal data, such as images and text. The study opts for identifying AI-generated face images, which are among the earliest and most notorious forms of DeepFakes produced through models like GANs and diffusion models. The authors systematically design experiments leveraging the GPT4V model, assessing its ability to determine whether a given image is AI-generated using various types of prompts.

A significant aspect of the paper is the exploration of different prompts, ranging from simple binary queries to more context-rich ones. The study highlights that simpler prompts often lead to higher rejection rates by the LLMs, thereby necessitating more intricate prompt engineering for effective DeepFake detection. This approach suggests that the inherent capabilities of LLMs, which are not originally tailored for media forensics, can still be deployed effectively with carefully designed user-text interfaces.

Key Findings and Performance Metrics

The study reports an AUC score of approximately 75% when using multimodal LLMs for detecting AI-generated face images. This AUC score indicates the LLMs' discernment capabilities, albeit limited compared to advanced programmed methodologies. While the model shows satisfactory performance in detecting AI-generated images, its accuracy drops significantly when identifying real images, owing to the absence of explicit semantic inconsistencies typically associated with AI-generated content.

The rejection rate of the LLM's responses plays a critical role in evaluating its practical utility. The paper emphasizes that effective prompt engineering can notably reduce rejection rates and improve the accuracy of the model's predictions. Through iterative querying and context-providing prompts, the efficacy of LLMs can be enhanced to discern between authentic and AI-generated content more reliably.

Implications and Speculative Future Directions

This research demonstrates the prospect of integrating LLMs into media forensics, offering a user-friendly, intuitive interface without requiring deep programming expertise. The study underscores that while LLMs can be a part of the media forensics toolkit, they currently lag behind traditional methods in accuracy. This shortfall is partly due to LLMs' reliance on semantic reasoning rather than signal-level details, which are more targeted in current DeepFake detection tools.

The paper speculates on the future development of LLMs in this domain, suggesting enhancements such as improved prompting strategies, incorporation of feedback mechanisms, and hybrid integration with established signal-processing techniques. These developments would potentially increase the LLMs’ utility in comprehensive media forensic tasks, including video analysis and the identification of text-image mis-contextualization.

Conclusion

In conclusion, the study provides a thorough examination of ChatGPT's abilities to detect DeepFakes, contributing valuable insights to the field of media forensics. While current performance may not compete with specialized detection systems, the promise of LLMs lies in their ease of accessibility and the significant potential to evolve. This paper encourages further exploration toward utilizing multimodal LLMs, enhancing their precision, and integrating them effectively with existing forensic methodologies for more robust media analysis solutions.