DocumentCode :
417107
Title :
Disentangling speaker and channel effects in speaker verification
Author :
Kenny, Patrick ; Dumouchel, Pierre
Author_Institution :
Centre de Recherche Informatique de Montreal, Que., Canada
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
We show how a joint factor analysis of inter-speaker and intra-speaker variability in a training database which contains multiple recordings for each speaker can be used to construct likelihood ratio statistics for speaker verification which take account of intra-speaker variation and channel variation in a principled way. We report the results of experiments on the NIST 2001 cellular one speaker detection task carried out by applying this type of factor analysis to Switchboard Cellular Part I. The evaluation data for this task is contained in Switchboard Cellular Part I so these results cannot be taken at face value but they indicate that the factor analysis model can perform extremely well if it is perfectly estimated.
Keywords :
maximum likelihood estimation; speaker recognition; NIST 2001 cellular one speaker detection task; Switchboard Cellular Part I; channel variation; inter-speaker variability; intra-speaker variability; joint factor analysis; likelihood ratio statistics; multiple recordings; speaker verification; training database; Adaptation model; Analysis of variance; Data mining; Databases; Face detection; Maximum likelihood linear regression; NIST; Performance analysis; Speaker recognition; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1325916
Filename :
1325916
Link To Document :
بازگشت