ASASVIcomtech: The Vicomtech-UGR Speech Deepfake Detection and SASV Systems for the ASVspoof5 Challenge
Authors: Juan M. Martín-Doñas, Eros Roselló, Angel M. Gomez, Aitor Álvarez, Iván López-Espejo, Antonio M. Peinado
Published: 2024-08-19 18:57:34+00:00
Comment: This paper was accepted at ASVspoof Workshop 2024
AI Summary
This paper details the ASASVIcomtech team's participation in the ASVspoof5 Challenge, addressing both speech deepfake detection (Track 1) and spoofing-aware speaker verification (Track 2). While a closed-condition system for Track 1 yielded unsatisfactory results, the team achieved very competitive performance in open-condition settings for both tracks through an ensemble system leveraging self-supervised models and augmented training data.
Abstract
This paper presents the work carried out by the ASASVIcomtech team, made up of researchers from Vicomtech and University of Granada, for the ASVspoof5 Challenge. The team has participated in both Track 1 (speech deepfake detection) and Track 2 (spoofing-aware speaker verification). This work started with an analysis of the challenge available data, which was regarded as an essential step to avoid later potential biases of the trained models, and whose main conclusions are presented here. With respect to the proposed approaches, a closed-condition system employing a deep complex convolutional recurrent architecture was developed for Track 1, although, unfortunately, no noteworthy results were achieved. On the other hand, different possibilities of open-condition systems, based on leveraging self-supervised models, augmented training data from previous challenges, and novel vocoders, were explored for both tracks, finally achieving very competitive results with an ensemble system.