GazeForensics: DeepFake Detection via Gaze-guided Spatial Inconsistency Learning

Authors: Qinlin He, Chunlei Peng, Decheng Liu, Nannan Wang, Xinbo Gao

Published: 2023-11-13 04:48:33+00:00

AI Summary

GazeForensics is a novel deepfake detection method that uses gaze representation from a 3D gaze estimation model to regularize the deepfake detection model, while also integrating general features. Experimental results show GazeForensics outperforms state-of-the-art methods.

Abstract

DeepFake detection is pivotal in personal privacy and public safety. With the iterative advancement of DeepFake techniques, high-quality forged videos and images are becoming increasingly deceptive. Prior research has seen numerous attempts by scholars to incorporate biometric features into the field of DeepFake detection. However, traditional biometric-based approaches tend to segregate biometric features from general ones and freeze the biometric feature extractor. These approaches resulted in the exclusion of valuable general features, potentially leading to a performance decline and, consequently, a failure to fully exploit the potential of biometric information in assisting DeepFake detection. Moreover, insufficient attention has been dedicated to scrutinizing gaze authenticity within the realm of DeepFake detection in recent years. In this paper, we introduce GazeForensics, an innovative DeepFake detection method that utilizes gaze representation obtained from a 3D gaze estimation model to regularize the corresponding representation within our DeepFake detection model, while concurrently integrating general features to further enhance the performance of our model. Experiment results reveal that our proposed GazeForensics outperforms the current state-of-the-art methods.


Key findings
GazeForensics achieved state-of-the-art performance on FaceForensics++ and WildDeepfake datasets. It showed consistent accuracy across different video qualities. The ablation study demonstrated the significant contribution of the MSE gaze constraint and leaky feature fusion.
Approach
GazeForensics integrates a pre-trained 3D gaze estimation model with a deepfake detection model. It uses a mean squared error (MSE) loss to regularize the detection model with gaze features and incorporates 'leaky' features to preserve general information, improving accuracy and robustness.
Datasets
FaceForensics++, Celeb-DF, WildDeepfake, Gaze360
Model(s)
ResNet-18 (for both DeepFake detection and gaze estimation backends), L2CS-Net (for 3D gaze estimation pre-training)
Author countries
China