Adversarially robust deepfake media detection using fused convolutional neural network predictions

Authors: Sohail Ahmed Khan, Alessandro Artusi, Hang Dai

Published: 2021-02-11 11:28:00+00:00

AI Summary

This paper proposes a deepfake video detection method that fuses predictions from three CNN models (VGG16, InceptionV3, and XceptionNet) to improve robustness and generalization. The fusion approach outperforms state-of-the-art methods on several datasets, achieving high accuracy even against adversarial attacks.

Abstract

Deepfakes are synthetically generated images, videos or audios, which fraudsters use to manipulate legitimate information. Current deepfake detection systems struggle against unseen data. To address this, we employ three different deep Convolutional Neural Network (CNN) models, (1) VGG16, (2) InceptionV3, and (3) XceptionNet to classify fake and real images extracted from videos. We also constructed a fusion of the deep CNN models to improve the robustness and generalisation capability. The proposed technique outperforms state-of-the-art models with 96.5% accuracy, when tested on publicly available DeepFake Detection Challenge (DFDC) test data, comprising of 400 videos. The fusion model achieves 99% accuracy on lower quality DeepFake-TIMIT dataset videos and 91.88% on higher quality DeepFake-TIMIT videos. In addition to this, we prove that prediction fusion is more robust against adversarial attacks. If one model is compromised by an adversarial attack, the prediction fusion does not let it affect the overall classification.


Key findings
The fused prediction model achieved 96.5% accuracy on the DFDC test set, outperforming state-of-the-art models. On the DeepFake-TIMIT dataset, it achieved 99% accuracy on low-resolution videos and 91.88% on high-resolution videos. The fusion significantly improved robustness against adversarial attacks.
Approach
The authors train three different CNN architectures on a subset of the DFDC dataset, employing various image augmentations. Predictions from these models are then fused by averaging frame-by-frame probabilities to classify videos as real or fake. This fusion improves robustness against adversarial attacks.
Datasets
DeepFake Detection Challenge (DFDC) dataset, DeepFake-TIMIT dataset
Model(s)
VGG16, InceptionV3, XceptionNet
Author countries
Cyprus, UAE