Towards the Detection of Diffusion Model Deepfakes

View on arXiv ← Back to list

Authors: Jonas Ricker, Simon Damm, Thorsten Holz, Asja Fischer

Published: 2022-10-26 09:01:19+00:00

AI Summary

This paper investigates the detection of images generated by diffusion models (DMs), a relatively unexplored area in deepfake detection. The authors find that state-of-the-art GAN detectors fail on DM-generated images, but retraining these detectors on DM data achieves near-perfect detection, even generalizing to GANs.

Abstract

In the course of the past few years, diffusion models (DMs) have reached an unprecedented level of visual quality. However, relatively little attention has been paid to the detection of DM-generated images, which is critical to prevent adverse impacts on our society. In contrast, generative adversarial networks (GANs), have been extensively studied from a forensic perspective. In this work, we therefore take the natural next step to evaluate whether previous methods can be used to detect images generated by DMs. Our experiments yield two key findings: (1) state-of-the-art GAN detectors are unable to reliably distinguish real from DM-generated images, but (2) re-training them on DM-generated images allows for almost perfect detection, which remarkably even generalizes to GANs. Together with a feature space analysis, our results lead to the hypothesis that DMs produce fewer detectable artifacts and are thus more difficult to detect compared to GANs. One possible reason for this is the absence of grid-like frequency artifacts in DM-generated images, which are a known weakness of GANs. However, we make the interesting observation that diffusion models tend to underestimate high frequencies, which we attribute to the learning objective.

Key findings

State-of-the-art GAN detectors perform poorly on DM-generated images. Retraining these detectors on DM data drastically improves detection accuracy and surprisingly generalizes to GAN detection. Diffusion models produce fewer detectable artifacts than GANs, particularly in the frequency domain.

Approach

The researchers evaluated the performance of existing GAN image detectors on DM-generated images. They found that retraining these detectors specifically on DM-generated images significantly improved their ability to distinguish between real and fake images, even generalizing to GANs.

Datasets

LSUN Bedroom (256x256), LSUN Church, ImageNet, FFHQ, additional datasets including variations of ADM, and popular text-to-image models like Stable Diffusion and Midjourney.

Model(s)

ResNet-50, EfficientNet-B4 (used in Mandelli2022). Wang2020 and Gragnaniello2021 architectures were also used and retrained.

Author countries

Germany

← Previous