Title :
Speaker verification performance with constrained durations
Author :
Sordo Martinez, Pablo L. ; Fauve, Benoit ; Larcher, Anthony ; Mason, John S. D.
Author_Institution :
Speech & Image Res. Group, Swansea Univ., Swansea, UK
Abstract :
Over the last decade speaker recognition has witnessed significant advances, with successful developments in Factor Analysis (FA) and more recently i-vectors, more than halving the error rates achieved by the classical UBM/GMM approach. However when very short duration utterances are considered, it is known that these improvements are much less. This paper begins with a review of the recent developments of i-vector systems with a focus on short test duration, in the region of 10 seconds or less. Experimental results are then presented showing that error rates rise from approximately 5% to 18% when the test duration is systematically reduced from 30 seconds to just 3 seconds. Interestingly, with the 30 seconds condition the i-vector error rate is in the region of half that of the corresponding UBM/GMM system. Nevertheless, when the test segments are just 3 seconds duration then the error rates of the 2 systems systems are very similar. All experiments relate to the short-short condition of the NIST 2008 SRE, but with the test segment duration systematically reduced.
Keywords :
speaker recognition; vectors; UBM/GMM approach; constrained durations; factor analysis; i-vector error rate; i-vector systems; speaker recognition; speaker verification performance; Context; Error analysis; NIST; Speaker recognition; Speech; Vectors; GMM/UBM; LDA; PLDA; Speaker verification; i-vectors; short duration;
Conference_Titel :
Biometrics and Forensics (IWBF), 2014 International Workshop on
Conference_Location :
Valletta
DOI :
10.1109/IWBF.2014.6914243