A Unified Framework for Stealthy Adversarial Generation via Latent Optimization and Transferability Enhancement
Authors: Gaozheng Pei, Ke Ma, Dongpeng Zhang, Chengzhi Sun, Qianqian Xu, Qingming Huang
Published: 2025-06-30 09:59:09+00:00
AI Summary
This paper proposes a unified framework for generating stealthy adversarial examples against deepfake detectors. The framework integrates traditional transferability enhancement strategies into diffusion model-based adversarial example generation, improving generalization and winning first place in the ACM MM25 Deepfake Detection adversarial attack competition.
Abstract
Due to their powerful image generation capabilities, diffusion-based adversarial example generation methods through image editing are rapidly gaining popularity. However, due to reliance on the discriminative capability of the diffusion model, these diffusion-based methods often struggle to generalize beyond conventional image classification tasks, such as in Deepfake detection. Moreover, traditional strategies for enhancing adversarial example transferability are challenging to adapt to these methods. To address these challenges, we propose a unified framework that seamlessly incorporates traditional transferability enhancement strategies into diffusion model-based adversarial example generation via image editing, enabling their application across a wider range of downstream tasks. Our method won first place in the 1st Adversarial Attacks on Deepfake Detectors: A Challenge in the Era of AI-Generated Media competition at ACM MM25, which validates the effectiveness of our approach.