HyperFake: Hyperspectral Reconstruction and Attention-Guided Analysis for Advanced Deepfake Detection

Authors: Pavan C Shekar, Pawan Soni, Vivek Kanhangad

Published: 2025-05-24 08:28:55+00:00

AI Summary

HyperFake is a novel deepfake detection pipeline that reconstructs hyperspectral data from RGB videos to reveal manipulation traces invisible to traditional methods. It uses an improved MST++ architecture for reconstruction, a spectral attention mechanism for feature selection, and an EfficientNet-based classifier for improved accuracy and generalization.

Abstract

Deepfakes pose a significant threat to digital media security, with current detection methods struggling to generalize across different manipulation techniques and datasets. While recent approaches combine CNN-based architectures with Vision Transformers or leverage multi-modal learning, they remain limited by the inherent constraints of RGB data. We introduce HyperFake, a novel deepfake detection pipeline that reconstructs 31-channel hyperspectral data from standard RGB videos, revealing hidden manipulation traces invisible to conventional methods. Using an improved MST++ architecture, HyperFake enhances hyperspectral reconstruction, while a spectral attention mechanism selects the most critical spectral features for deepfake detection. The refined spectral data is then processed by an EfficientNet-based classifier optimized for spectral analysis, enabling more accurate and generalizable detection across different deepfake styles and datasets, all without the need for expensive hyperspectral cameras. To the best of our knowledge, this is the first approach to leverage hyperspectral imaging reconstruction for deepfake detection, opening new possibilities for detecting increasingly sophisticated manipulations.


Key findings
HyperFake achieved 98.94% training accuracy and 92% validation accuracy on the FaceForensics++ dataset, significantly outperforming ResNet-50 and EfficientNet-B7 baselines. This demonstrates the effectiveness of using hyperspectral reconstruction for deepfake detection.
Approach
HyperFake reconstructs 31-channel hyperspectral data from RGB videos using an enhanced MST++ model. A spectral attention mechanism then selects the most important spectral features, which are finally classified using an EfficientNet-based classifier.
Datasets
FaceForensics++
Model(s)
Improved MST++, EfficientNet-B0
Author countries
India