Title :
Exploiting magnitude and phase spectral information for converted speech detection
Author :
Correia, Maria Joana ; Abad, Alberto ; Trancoso, Isabel
Author_Institution :
INESC-ID/Spoken Language Syst. Lab., Lisbon, Portugal
Abstract :
Speaker verification systems have been shown to be vulnerable in situations where voice conversion techniques are used to try to fool them, evidencing an important security breach in these applications. This work focuses on the development of a new converted speech detector able to robustly address this problem. The proposed detector uses four spectral features extracted from the magnitude and the phase spectrum of the speech signal. To evaluate the performance of the detector we use a subset of the core task of the NIST SRE2006 corpus as the natural data. The converted data was produced with two different voice conversion methods: Gaussian mixture model and unit selection, from other NIST SRE2006 conditions. The converted speech detector achieved a detection accuracy of 99.1% and 98.5% for natural and converted utterances, respectively.
Keywords :
Gaussian processes; feature extraction; mixture models; speaker recognition; Gaussian mixture model; NIST SRE2006 conditions; NIST SRE2006 corpus; converted speech detection; feature extraction; magnitude information; phase spectral information; phase spectrum; security breach; speaker verification systems; spectral features; speech signal; unit selection; voice conversion techniques; Accuracy; Detectors; Feature extraction; NIST; Speech; Support vector machines; Vectors; Converted speech detection; Speaker verification; Spoofing; Voice conversion;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
DOI :
10.1109/SLT.2014.7078607