Practical Deepfake Detection: Vulnerabilities in Global Contexts

View on arXiv ← Back to list

Authors: Yang A. Chuming, Daniel J. Wu, Ken Hong

Published: 2022-06-20 15:24:55+00:00

AI Summary

This paper investigates the vulnerability of a state-of-the-art deepfake detection algorithm to video corruptions simulating real-world conditions, such as low bitrate and resolution. The authors find that while the model is robust to augmentations used during training, it performs poorly on videos with decreased quality, incorrectly identifying authentic videos as deepfakes.

Abstract

Recent advances in deep learning have enabled realistic digital alterations to videos, known as deepfakes. This technology raises important societal concerns regarding disinformation and authenticity, galvanizing the development of numerous deepfake detection algorithms. At the same time, there are significant differences between training data and in-the-wild video data, which may undermine their practical efficacy. We simulate data corruption techniques and examine the performance of a state-of-the-art deepfake detection algorithm on corrupted variants of the FaceForensics++ dataset. While deepfake detection models are robust against video corruptions that align with training-time augmentations, we find that they remain vulnerable to video corruptions that simulate decreases in video quality. Indeed, in the controversial case of the video of Gabonese President Bongo's new year address, the algorithm, which confidently authenticates the original video, judges highly corrupted variants of the video to be fake. Our work opens up both technical and ethical avenues of exploration into practical deepfake detection in global contexts.

Key findings

The deepfake detection model's accuracy significantly decreased when tested on corrupted videos, particularly those with high CRF values and datamoshing. This vulnerability highlights the limitations of current deepfake detection methods in real-world scenarios, especially in regions with less developed technology infrastructure. A case study of a Gabonese presidential address video demonstrates the potential for misidentification due to video corruption.

Approach

The researchers used a video corruption pipeline to simulate real-world degradation (low bitrate, resolution, CRF, datamoshing) on the FaceForensics++ dataset. They then evaluated the performance of a top-performing deepfake detection model (an ensemble of EfficientNet B7s) on these corrupted videos.

Datasets

FaceForensics++

Model(s)

Ensemble of 7 EfficientNet B7s

Author countries

USA

← Previous