Investigation of ensemble methods for the detection of deepfake face manipulations

View on arXiv ← Back to list

Authors: Nikolaos Giatsoglou, Symeon Papadopoulos, Ioannis Kompatsiaris

Published: 2023-04-14 21:18:51+00:00

AI Summary

This paper investigates the effectiveness of ensemble methods for deepfake face manipulation detection. The authors explore and compare multiple ensemble detector designs to achieve robustness and good generalization, leveraging models specializing in different manipulation categories. Results show that ensembles can surpass individual models in accuracy but require extensive, diverse training data for strong generalization.

Abstract

The recent wave of AI research has enabled a new brand of synthetic media, called deepfakes. Deepfakes have impressive photorealism, which has generated exciting new use cases but also raised serious threats to our increasingly digital world. To mitigate these threats, researchers have tried to come up with new methods for deepfake detection that are more effective than traditional forensics and heavily rely on deep AI technology. In this paper, following up on encouraging prior work for deepfake detection with attribution and ensemble techniques, we explore and compare multiple designs for ensemble detectors. The goal is to achieve robustness and good generalization ability by leveraging ensembles of models that specialize in different manipulation categories. Our results corroborate that ensembles can achieve higher accuracy than individual models when properly tuned, while the generalization ability relies on access to a large number of training data for a diverse set of known manipulations.

Key findings

Properly tuned ensembles achieved higher accuracy than individual models on the FaceForensics++ dataset. Generalization to unseen datasets was poor, highlighting the need for more diverse training data. Multiclass attribution models and one-vs-rest ensembles showed promise for improved generalization.

Approach

The authors use an ensemble of EfficientNet-B0 models for frame-level deepfake detection. They explore different ensemble designs, including binary detection, multiclass attribution, one-vs-real, and one-vs-rest approaches. The output of individual models is soft-combined to produce a final classification.

Datasets

FaceForensics++, Celeb-DF, DFDC preview, DFDC, OpenForensics

Model(s)

EfficientNet-B0

Author countries

Greece

← Previous