Hiding Faces in Plain Sight: Defending DeepFakes by Disrupting Face Detection

View on arXiv ← Back to list

Authors: Delong Zhu, Yuezun Li, Baoyuan Wu, Jiaran Zhou, Zhibo Wang, Siwei Lyu

Published: 2024-12-02 04:17:48+00:00

AI Summary

This paper proposes FacePoison, a proactive DeepFake defense framework that hinders DeepFake generation by disrupting face detection, a crucial preprocessing step in most DeepFake methods. It introduces VideoFacePoison, an efficient extension that propagates adversarial perturbations across video frames, reducing computational overhead.

Abstract

This paper investigates the feasibility of a proactive DeepFake defense framework, {em FacePosion}, to prevent individuals from becoming victims of DeepFake videos by sabotaging face detection. The motivation stems from the reliance of most DeepFake methods on face detectors to automatically extract victim faces from videos for training or synthesis (testing). Once the face detectors malfunction, the extracted faces will be distorted or incorrect, subsequently disrupting the training or synthesis of the DeepFake model. To achieve this, we adapt various adversarial attacks with a dedicated design for this purpose and thoroughly analyze their feasibility. Based on FacePoison, we introduce {em VideoFacePoison}, a strategy that propagates FacePoison across video frames rather than applying them individually to each frame. This strategy can largely reduce the computational overhead while retaining the favorable attack performance. Our method is validated on five face detectors, and extensive experiments against eleven different DeepFake models demonstrate the effectiveness of disrupting face detectors to hinder DeepFake generation.

Key findings

Experiments show FacePoison effectively disrupts five face detectors, reducing their F1-scores significantly. VideoFacePoison successfully propagates adversarial perturbations with reduced computational overhead. The approach effectively degrades the visual quality of DeepFake faces generated by eleven different models in both training and inference phases.

Approach

FacePoison adds imperceptible adversarial perturbations to images to disrupt DNN-based face detectors. VideoFacePoison extends this by propagating perturbations across video frames using optical flow, reducing computational cost.

Datasets

WIDER, FaceForensics++, Celeb-DF, DeepFake Detection Challenge (DFDC), preview version of DFDC (DFDCP)

Model(s)

RetinaFace, YOLO5Face, PyramidBox, S3FD, DSFD, SimSwap, InfoSwap, MobileFaceSwap, BlendFace, FaceSwap (Origin, DFaker, IAE, LightWeight, DFLH), CDFv1, CDFv2

Author countries

China, Hong Kong, USA

← Previous