Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics
Abstract: DeepFakes, which refer to AI-generated media content, have become an increasing concern due to their use as a means for disinformation. Detecting DeepFakes is currently solved with programmed machine learning algorithms. In this work, we investigate the capabilities of multimodal LLMs in DeepFake detection. We conducted qualitative and quantitative experiments to demonstrate multimodal LLMs and show that they can expose AI-generated images through careful experimental design and prompt engineering. This is interesting, considering that LLMs are not inherently tailored for media forensic tasks, and the process does not require programming. We discuss the limitations of multimodal LLMs for these tasks and suggest possible improvements.
- Experts: Spy used ai-generated face to connect with targets. https://www.theverge.com/2019/6/13/18677341/ai-generated-fake-faces-spy-linked-in-contacts-associated-press.
- Gpt-4 technical report. arXiv preprint arXiv:2303.08774, 2023.
- End-to-end reconstruction-classification learning for face forgery detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4113–4122, 2022.
- How Do the Hearts of Deep Fakes Beat? Deep Fake Source Detection via Interpreting Residuals with Biological Signals. In IEEE/IAPR International Joint Conference on Biometrics (IJCB), 2020.
- CNN. A high school student created a fake 2020 US candidate. twitter verified it. https://www.cnn.com/2020/02/28/tech/fake-twitter-candidate-2020/index.html, a.
- CNN. How fake faces are being weaponized online. https://www.cnn.com/2020/02/20/tech/fake-faces-deepfake/index.html, b.
- On the detection of synthetic images generated by diffusion models. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 1–5. IEEE, 2023.
- How good is chatgpt at face biometrics? a first look into recognition, soft biometrics, and explainability, 2024.
- Explaining deepfake detection by analysing image matching. In European Conference on Computer Vision, pages 18–35. Springer, 2022.
- Leveraging frequency analysis for deep fake image recognition. arXiv preprint arXiv:2003.08685, 2020.
- Are gan generated images easy to detect? a critical analysis of the state-of-the-art. In ICME, pages 1–6. IEEE, 2021.
- Fcd-net: Learning to detect multiple types of homologous deepfake face images. IEEE Transactions on Information Forensics and Security, 2023.
- Beyond the spectrum: Detecting deepfakes via re-synthesis. In 30th International Joint Conference on Artificial Intelligence (IJCAI), 2021.
- Exposing GAN-generated faces using inconsistent corneal specular highlights. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, Canada, 2021.
- Fusing global and local features for generalized ai-synthesized image detection. In 2022 IEEE International Conference on Image Processing (ICIP), pages 3465–3469. IEEE, 2022.
- Glff: Global and local feature fusion for ai-synthesized image detection. IEEE Transactions on Multimedia, 2023.
- Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196, 2017.
- A style-based generator architecture for generative adversarial networks. In CVPR, 2019.
- Analyzing and improving the image quality of stylegan. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8110–8119, 2020.
- Face x-ray for more general face forgery detection. In CVPR, 2020a.
- Exposing deepfake videos by detecting face warping artifacts. In IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2019.
- In Ictu Oculi: Exposing AI Created Fake Videos by Detecting Eye Blinking. In IEEE Workshop on Information Forensics and Security (WIFS), Hong Kong, 2018.
- Celeb-DF: A Large-scale Challenging Dataset for DeepFake Forensics. In IEEE Conference on Computer Vision and Patten Recognition (CVPR), Seattle, WA, United States, 2020b.
- Exploiting visual artifacts to expose deepfakes and face manipulations. In 2019 IEEE Winter Applications of Computer Vision Workshops (WACVW), pages 83–92, 2019.
- Detecting GAN-generated imagery using color cues. arXiv preprint arXiv:1812.08247, 2018.
- Capsule-forensics: Using capsule networks to detect forged images and videos. In ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 2307–2311. IEEE, 2019.
- Reuters. These faces are not real. https://graphics.reuters.com/CYBER-DEEPFAKE/ACTIVIST/nmovajgnxpa/index.html.
- High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10684–10695, 2022.
- FaceForensics++: Learning to detect manipulated facial images. In ICCV, 2019.
- Chatgpt for digital forensic investigation: The good, the bad, and the unknown. Forensic Science International: Digital Investigation, 46:301609, 2023.
- Shield: An evaluation benchmark for face spoofing and forgery detection with multimodal large language models. arXiv preprint arXiv:2402.04178, 2024.
- Gemini: a family of highly capable multimodal models. arXiv preprint arXiv:2312.11805, 2023.
- Attention is all you need. Advances in neural information processing systems, 30, 2017.
- Cnn-generated images are surprisingly easy to spot… for now. arXiv: Computer Vision and Pattern Recognition, 2019.
- Chain-of-thought prompting elicits reasoning in large language models. Advances in Neural Information Processing Systems, 35:24824–24837, 2022.
- Wavelet-packets for deepfake image analysis and detection. Machine Learning, 111(11):4295–4327, 2022.
- Can gpt-4v (ision) serve medical applications? case studies on gpt-4v for multimodal medical diagnosis. arXiv preprint arXiv:2310.09909, 2023.
- Supervised contrastive learning for generalizable and explainable deepfakes detection. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pages 379–389, 2022.
- Exposing GAN-synthesized faces using landmark locations. In International Workshop on Information Hiding and Multimedia Security, Paris, France, 2019.
- The dawn of lmms: Preliminary explorations with gpt-4v (ision). arXiv preprint arXiv:2309.17421, 9(1):1, 2023.
- Lsun: Construction of a large-scale image dataset using deep learning with humans in the loop. arXiv preprint arXiv:1506.03365, 2015.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.