DocumentCode
417107
Title
Disentangling speaker and channel effects in speaker verification
Author
Kenny, Patrick ; Dumouchel, Pierre
Author_Institution
Centre de Recherche Informatique de Montreal, Que., Canada
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
We show how a joint factor analysis of inter-speaker and intra-speaker variability in a training database which contains multiple recordings for each speaker can be used to construct likelihood ratio statistics for speaker verification which take account of intra-speaker variation and channel variation in a principled way. We report the results of experiments on the NIST 2001 cellular one speaker detection task carried out by applying this type of factor analysis to Switchboard Cellular Part I. The evaluation data for this task is contained in Switchboard Cellular Part I so these results cannot be taken at face value but they indicate that the factor analysis model can perform extremely well if it is perfectly estimated.
Keywords
maximum likelihood estimation; speaker recognition; NIST 2001 cellular one speaker detection task; Switchboard Cellular Part I; channel variation; inter-speaker variability; intra-speaker variability; joint factor analysis; likelihood ratio statistics; multiple recordings; speaker verification; training database; Adaptation model; Analysis of variance; Data mining; Databases; Face detection; Maximum likelihood linear regression; NIST; Performance analysis; Speaker recognition; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325916
Filename
1325916
Link To Document