ED$^4$: Explicit Data-level Debiasing for Deepfake Detection

Authors: Jikang Cheng, Ying Zhang, Qin Zou, Zhiyuan Yan, Chao Liang, Zhongyuan Wang, Chen Li

Published: 2024-08-13 10:05:20+00:00

AI Summary

This paper introduces ED⁴, a data-level debiasing strategy for deepfake detection that addresses content, specific-forgery, and a novel spatial bias. ED⁴ uses ClockMix for data augmentation and an Adversarial Spatial Consistency Module (AdvSCM) to mitigate spatial bias, significantly improving generalization.

Abstract

Learning intrinsic bias from limited data has been considered the main reason for the failure of deepfake detection with generalizability. Apart from the discovered content and specific-forgery bias, we reveal a novel spatial bias, where detectors inertly anticipate observing structural forgery clues appearing at the image center, also can lead to the poor generalization of existing methods. We present ED$^4$, a simple and effective strategy, to address aforementioned biases explicitly at the data level in a unified framework rather than implicit disentanglement via network design. In particular, we develop ClockMix to produce facial structure preserved mixtures with arbitrary samples, which allows the detector to learn from an exponentially extended data distribution with much more diverse identities, backgrounds, local manipulation traces, and the co-occurrence of multiple forgery artifacts. We further propose the Adversarial Spatial Consistency Module (AdvSCM) to prevent extracting features with spatial bias, which adversarially generates spatial-inconsistent images and constrains their extracted feature to be consistent. As a model-agnostic debiasing strategy, ED$^4$ is plug-and-play: it can be integrated with various deepfake detectors to obtain significant benefits. We conduct extensive experiments to demonstrate its effectiveness and superiority over existing deepfake detection approaches.


Key findings
ED⁴ outperforms state-of-the-art deepfake detection methods across multiple datasets. Ablation studies demonstrate the effectiveness of both ClockMix and AdvSCM. The plug-and-play nature of ED⁴ allows for significant performance improvements when integrated with existing detectors.
Approach
ED⁴ explicitly debiases deepfake detection at the data level using two modules: ClockMix, which creates diverse mixtures of images to address content and specific-forgery biases, and AdvSCM, which adversarially generates spatially inconsistent images to mitigate spatial bias. This is a plug-and-play method that can be integrated with various detectors.
Datasets
FaceForensics++ (FF++) (HQ), Celeb-DF-v1, Celeb-DF-v2, DeepFakeDetection (DFD), DeepFake Detection Challenge (DFDC), DF40, DiffusionFace, WildDeepfake, FakeAVCeleb, E4S, BlendFace, UniFace, DeepfaceLab, DiffSwap, SDv21
Model(s)
Xception (used as backbone for both adversarial generator and extractor in ED⁴), EfficientNetB4, SPSL, UCF, DCL, TALL, CDFA, RAM (used as baselines)
Author countries
China