WMamba: Wavelet-based Mamba for Face Forgery Detection
Authors: Siran Peng, Tianshuo Zhang, Li Gao, Xiangyu Zhu, Haoyuan Zhang, Kai Pang, Zhen Lei
Published: 2025-01-16 15:44:24+00:00
Comment: Accepted by ACM MM 2025
AI Summary
This paper introduces WMamba, a novel wavelet-based feature extractor built upon the Mamba architecture for robust face forgery detection. It enhances forgery detection by proposing Dynamic Contour Convolution (DCConv) to adaptively model slender facial contours and leveraging the Mamba architecture to capture long-range spatial relationships with linear complexity. WMamba effectively extracts fine-grained, globally distributed forgery artifacts, achieving state-of-the-art performance.
Abstract
The rapid evolution of deepfake generation technologies necessitates the development of robust face forgery detection algorithms. Recent studies have demonstrated that wavelet analysis can enhance the generalization abilities of forgery detectors. Wavelets effectively capture key facial contours, often slender, fine-grained, and globally distributed, that may conceal subtle forgery artifacts imperceptible in the spatial domain. However, current wavelet-based approaches fail to fully exploit the distinctive properties of wavelet data, resulting in sub-optimal feature extraction and limited performance gains. To address this challenge, we introduce WMamba, a novel wavelet-based feature extractor built upon the Mamba architecture. WMamba maximizes the utility of wavelet information through two key innovations. First, we propose Dynamic Contour Convolution (DCConv), which employs specially crafted deformable kernels to adaptively model slender facial contours. Second, by leveraging the Mamba architecture, our method captures long-range spatial relationships with linear complexity. This efficiency allows for the extraction of fine-grained, globally distributed forgery artifacts from small image patches. Extensive experiments show that WMamba achieves state-of-the-art (SOTA) performance, highlighting its effectiveness in face forgery detection.