EfficientNet-Based Multi-Class Detection of Real, Deepfake, and Plastic Surgery Faces

View on arXiv ← Back to list

Authors: Li Kun, Milena Radenkovic

Published: 2025-09-12 17:38:51+00:00

AI Summary

This research paper proposes a multi-class deepfake detection model using EfficientNet-B4 to identify real, deepfake, and plastic surgery faces. The model achieves high accuracy in classifying these three categories, demonstrating its effectiveness in detecting various forms of facial manipulation.

Abstract

Currently, deep learning has been utilised to tackle several difficulties in our everyday lives. It not only exhibits progress in computer vision but also constitutes the foundation for several revolutionary technologies. Nonetheless, similar to all phenomena, the use of deep learning in diverse domains has produced a multifaceted interaction of advantages and disadvantages for human society. Deepfake technology has advanced, significantly impacting social life. However, developments in this technology can affect privacy, the reputations of prominent personalities, and national security via software development. It can produce indistinguishable counterfeit photographs and films, potentially impairing the functionality of facial recognition systems, so presenting a significant risk. The improper application of deepfake technology produces several detrimental effects on society. Face-swapping programs mislead users by altering persons' appearances or expressions to fulfil particular aims or to appropriate personal information. Deepfake technology permeates daily life through such techniques. Certain individuals endeavour to sabotage election campaigns or subvert prominent political figures by creating deceptive pictures to influence public perception, causing significant harm to a nation's political and economic structure.

Key findings

EfficientNet-B4 achieved the highest accuracy (97%) with a good F1-score across all classes after only 5 epochs. ResNet-50 and VGG-16 models also performed well, but required more training epochs to reach comparable accuracy. The confusion matrix demonstrated good generalization ability of the EfficientNet-B4 model on the test dataset.

Approach

The authors utilize a multi-class classification approach. They employ MTCNN for face detection and feature extraction, followed by EfficientNet-B4, ResNet-50, and VGG16 for deepfake detection. The performance of these models is evaluated on the DeeperForensics-1.0 and HDA Plastic Surgery Face Database datasets.

Datasets

DeeperForensics-1.0, HDA Plastic Surgery Face Database

Model(s)

EfficientNet-B4, ResNet-50, VGG-16

Author countries

← Previous