Do Deepfake Detectors Work in Reality?

Authors: Simiao Ren, Hengwei Xu, Tsang Ng, Kidus Zewde, Shengkai Jiang, Ramini Desai, Disha Patil, Ning-Yau Cheng, Yining Zhou, Ragavi Muthukrishnan

Published: 2025-02-15 22:38:40+00:00

AI Summary

This research reveals that post-processing, specifically super-resolution, significantly reduces the effectiveness of existing deepfake detection methods. The authors introduce the first real-world faceswap dataset and demonstrate that state-of-the-art detectors perform poorly on this data, highlighting a critical gap between academic research and real-world applications.

Abstract

Deepfakes, particularly those involving faceswap-based manipulations, have sparked significant societal concern due to their increasing realism and potential for misuse. Despite rapid advancements in generative models, detection methods have not kept pace, creating a critical gap in defense strategies. This disparity is further amplified by the disconnect between academic research and real-world applications, which often prioritize different objectives and evaluation criteria. In this study, we take a pivotal step toward bridging this gap by presenting a novel observation: the post-processing step of super-resolution, commonly employed in real-world scenarios, substantially undermines the effectiveness of existing deepfake detection methods. To substantiate this claim, we introduce and publish the first real-world faceswap dataset, collected from popular online faceswap platforms. We then qualitatively evaluate the performance of state-of-the-art deepfake detectors on real-world deepfakes, revealing that their accuracy approaches the level of random guessing. Furthermore, we quantitatively demonstrate the significant performance degradation caused by common post-processing techniques. By addressing this overlooked challenge, our study underscores a critical avenue for enhancing the robustness and practical applicability of deepfake detection methods in real-world settings.


Key findings
State-of-the-art deepfake detectors achieve near-random accuracy on the real-world faceswap dataset. Super-resolution post-processing significantly degrades the performance of these detectors. The gap between academic research and real-world applications in deepfake detection is substantial.
Approach
The authors created a new dataset of real-world deepfakes collected from online faceswap platforms. They then evaluated state-of-the-art deepfake detectors on this dataset and quantitatively analyzed the performance degradation caused by super-resolution post-processing.
Datasets
Real-World Faceswap (RWFS) dataset (created by authors), Celeb dataset (for real images), FaceForensics++ (FF++)
Model(s)
EfficientNet-B4, Self-blended images model, GFPGAN, CodeFormer (for super-resolution)
Author countries
USA