Speaker Privacy and Security in the Big Data Era: Protection and Defense against Deepfake
Authors: Liping Chen, Kong Aik Lee, Zhen-Hua Ling, Xin Wang, Rohan Kumar Das, Tomoki Toda, Haizhou Li
Published: 2025-09-08 06:22:36+00:00
AI Summary
This paper provides a concise overview of three techniques for addressing security threats from deepfake speech: voice anonymization, deepfake detection, and watermarking. It describes their methodologies, advancements, and challenges, highlighting the need for further research into integrating these techniques.
Abstract
In the era of big data, remarkable advancements have been achieved in personalized speech generation techniques that utilize speaker attributes, including voice and speaking style, to generate deepfake speech. This has also amplified global security risks from deepfake speech misuse, resulting in considerable societal costs worldwide. To address the security threats posed by deepfake speech, techniques have been developed focusing on both the protection of voice attributes and the defense against deepfake speech. Among them, the voice anonymization technique has been developed to protect voice attributes from extraction for deepfake generation, while deepfake detection and watermarking have been utilized to defend against the misuse of deepfake speech. This paper provides a short and concise overview of the three techniques, describing the methodologies, advancements, and challenges. A comprehensive version, offering additional discussions, will be published in the near future.