ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks

Authors: Renshuai Tao, Manyi Le, Chuangchuang Tan, Huan Liu, Haotong Qin, Yao Zhao

Published: 2024-10-24 12:32:22+00:00

AI Summary

The paper introduces ODDN, a novel deepfake detection network addressing the scarcity of paired data in open-world scenarios on online social networks. ODDN uses open-world data aggregation (ODA) and compression-discard gradient correction (CGC) to effectively leverage both paired and unpaired data, improving robustness against compression variations.

Abstract

Despite significant advances in deepfake detection, handling varying image quality, especially due to different compressions on online social networks (OSNs), remains challenging. Current methods succeed by leveraging correlations between paired images, whether raw or compressed. However, in open-world scenarios, paired data is scarce, with compressed images readily available but corresponding raw versions difficult to obtain. This imbalance, where unpaired data vastly outnumbers paired data, often leads to reduced detection performance, as existing methods struggle without corresponding raw images. To overcome this issue, we propose a novel approach named the open-world deepfake detection network (ODDN), which comprises two core modules: open-world data aggregation (ODA) and compression-discard gradient correction (CGC). ODA effectively aggregates correlations between compressed and raw samples through both fine-grained and coarse-grained analyses for paired and unpaired data, respectively. CGC incorporates a compression-discard gradient correction to further enhance performance across diverse compression methods in OSN. This technique optimizes the training gradient to ensure the model remains insensitive to compression variations. Extensive experiments conducted on 17 popular deepfake datasets demonstrate the superiority of the ODDN over SOTA baselines.


Key findings
ODDN significantly outperforms SOTA baselines on 17 datasets in both quality-aware and quality-agnostic settings. Ablation studies confirm the contributions of ODA and CGC. Feature visualization shows ODDN's improved ability to separate features of real and fake images.
Approach
ODDN tackles the unpaired data challenge with two modules: ODA aggregates correlations between compressed and raw images (fine-grained for paired, coarse-grained for unpaired data). CGC corrects gradients to make the model insensitive to compression variations, using PCGrad to align gradients from different loss functions.
Datasets
17 popular GAN-based datasets (8 from ForenSynths, 9 from GANGen-Detection), along with ForenSynths training data (real images from LSUN, synthetic images from ProGAN).
Model(s)
ResNet-50 (as the encoder), with additional modules for ODA and CGC (including self-attention and PCGrad).
Author countries
China, Switzerland