DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake Detection

Authors: Yewon Lim, Changyeon Lee, Aerin Kim, Oren Etzioni

Published: 2024-06-02 20:22:38+00:00

Comment: 6 pages, 1 figure

AI Summary

This paper introduces DistilDIRE, a novel approach for rapid deepfake detection of diffusion-generated images. It addresses the significant computational load of existing methods like DIRE by distilling knowledge from pre-trained diffusion models into a lightweight detection model. DistilDIRE achieves a substantial reduction in operational demands and inference time while maintaining robust detection performance.

Abstract

A dramatic influx of diffusion-generated images has marked recent years, posing unique challenges to current detection technologies. While the task of identifying these images falls under binary classification, a seemingly straightforward category, the computational load is significant when employing the reconstruction then compare technique. This approach, known as DIRE (Diffusion Reconstruction Error), not only identifies diffusion-generated images but also detects those produced by GANs, highlighting the technique's broad applicability. To address the computational challenges and improve efficiency, we propose distilling the knowledge embedded in diffusion models to develop rapid deepfake detection models. Our approach, aimed at creating a small, fast, cheap, and lightweight diffusion synthesized deepfake detector, maintains robust performance while significantly reducing operational demands. Maintaining performance, our experimental results indicate an inference speed 3.2 times faster than the existing DIRE framework. This advance not only enhances the practicality of deploying these systems in real-world settings but also paves the way for future research endeavors that seek to leverage diffusion model knowledge.


Key findings
DistilDIRE achieves an inference speed 3.2 times faster than the existing DIRE framework, with approximately 97% reduction in computational demand (FLOPS) compared to DIRE. It maintains high accuracy and average precision (e.g., 99.0/99.7% against SD-v1 on ImageNet and 100/100% against SD-v2 on CelebA-HQ) across various diffusion and GAN-generated images. The inclusion of knowledge distillation loss significantly enhances both accuracy and generalization capabilities.
Approach
DistilDIRE leverages a knowledge distillation framework where a pre-trained ResNet-50 (teacher) guides a student model. The student model is trained with classification loss and a knowledge distillation loss on feature maps, using a concatenated input of the original image and the first-step predicted noise from an Ablated Diffusion Model (ADM). This method enables efficient deepfake detection without computationally intensive reconstruction.
Datasets
DiffusionForensics dataset, ImageNet subset, CelebA-HQ subset
Model(s)
ResNet-50 (as teacher and student models), Ablated Diffusion Model (ADM) for noise extraction
Author countries
Republic of Korea, United States