Pay Less Attention to Deceptive Artifacts: Robust Detection of Compressed Deepfakes on Online Social Networks

View on arXiv ← Back to list

Authors: Manyi Li, Renshuai Tao, Yufan Liu, Chuangchuang Tan, Haotong Qin, Bing Li, Yunchao Wei, Yao Zhao

Published: 2025-06-25 15:46:41+00:00

AI Summary

This paper proposes PLADA, a novel deepfake detection framework that addresses the challenge of detecting compressed deepfakes on online social networks. PLADA utilizes a dual-stage attention mechanism to handle compression artifacts and incorporates both paired and unpaired data for improved detection, outperforming state-of-the-art methods.

Abstract

With the rapid advancement of deep learning, particularly through generative adversarial networks (GANs) and diffusion models (DMs), AI-generated images, or ``deepfakes, have become nearly indistinguishable from real ones. These images are widely shared across Online Social Networks (OSNs), raising concerns about their misuse. Existing deepfake detection methods overlook the ``block effects introduced by compression in OSNs, which obscure deepfake artifacts, and primarily focus on raw images, rarely encountered in real-world scenarios. To address these challenges, we propose PLADA (Pay Less Attention to Deceptive Artifacts), a novel framework designed to tackle the lack of paired data and the ineffective use of compressed images. PLADA consists of two core modules: Block Effect Eraser (B2E), which uses a dual-stage attention mechanism to handle block effects, and Open Data Aggregation (ODA), which processes both paired and unpaired data to improve detection. Extensive experiments across 26 datasets demonstrate that PLADA achieves a remarkable balance in deepfake detection, outperforming SoTA methods in detecting deepfakes on OSNs, even with limited paired data and compression. More importantly, this work introduces the ``block effect as a critical factor in deepfake detection, providing a robust solution for open-world scenarios. Our code is available at https://github.com/ManyiLee/PLADA.

Key findings

PLADA significantly outperforms state-of-the-art methods in detecting compressed deepfakes on online social networks, even with limited paired data. The results show superior performance across various GAN and DM-generated datasets and robustness across different compression levels. The 'block effect' introduced by compression is identified as a key challenge in deepfake detection.

Approach

PLADA consists of two modules: Block Effect Eraser (B2E) which uses a dual-stage attention mechanism to mitigate compression artifacts, and Open Data Aggregation (ODA) which processes both paired and unpaired data to improve detection robustness. Adversarial learning is used to guide attention towards deepfake artifacts.

Datasets

ForenSynths, GANGen-Detection, Ojha-test, DiffusionForensics (26 datasets in total)

Model(s)

CLIP (with modifications using B2E and ODA)

Author countries

China

← Previous