A Novel Unified Approach to Deepfake Detection
Authors: Lord Sen, Shyamapada Mukherjee
Published: 2026-01-06 19:30:53+00:00
AI Summary
This paper introduces a novel architecture for Deepfake detection in images and videos. The proposed method utilizes cross-attention between spatial and frequency domain features, augmented with a blood detection module, to classify content as real or fake. It achieves state-of-the-art results, including 99.80% and 99.88% AUC on FF++ and Celeb-DF datasets, and demonstrates strong generalization across datasets.
Abstract
The advancements in the field of AI is increasingly giving rise to various threats. One of the most prominent of them is the synthesis and misuse of Deepfakes. To sustain trust in this digital age, detection and tagging of deepfakes is very necessary. In this paper, a novel architecture for Deepfake detection in images and videos is presented. The architecture uses cross attention between spatial and frequency domain features along with a blood detection module to classify an image as real or fake. This paper aims to develop a unified architecture and provide insights into each step. Though this approach we achieve results better than SOTA, specifically 99.80%, 99.88% AUC on FF++ and Celeb-DF upon using Swin Transformer and BERT and 99.55, 99.38 while using EfficientNet-B4 and BERT. The approach also generalizes very well achieving great cross dataset results as well.