DocumentCode
1122198
Title
On the use of different speech representations for speaker modeling
Author
Chen, Ke
Author_Institution
Sch. of Informatics, Univ. of Manchester, UK
Volume
35
Issue
3
fYear
2005
Firstpage
301
Lastpage
314
Abstract
Numerous speech representations have been reported to be useful in speaker recognition. However, there is much less agreement on which speech representation provides a perfect representation of speaker-specific information conveyed in a speech signal. Unlike previous work, we propose an alternative approach to speaker modeling by the simultaneous use of different speech representations in an optimal way. Inspired by our previous empirical studies, we present a soft competition scheme on different speech representations to exploit different speech representations in encoding speaker-specific information. On the basis of this soft competition scheme, we present a parametric statistical model, generalized Gaussian mixture model (GGMM), to characterize a speaker identity based on different speech representations. Moreover, we develop an expectation-maximization algorithm for parameter estimation in the GGMM. The proposed speaker modeling approach has been applied to text-independent speaker recognition and comparative results on the KING speech corpus demonstrate its effectiveness.
Keywords
Gaussian processes; parameter estimation; speaker recognition; statistical analysis; KING speech corpus; expectation-maximization algorithm; generalized Gaussian mixture model; parameter estimation; parametric statistical model; soft competition scheme; speaker modeling; speaker-specific information encoding; speech representations; speech signal; text-independent speaker recognition; Cepstral analysis; Data mining; Encoding; Feature extraction; Pattern recognition; Signal processing; Speaker recognition; Speech analysis; Speech processing; Speech recognition; Different speech representations; KING speech corpus; expectation-maximazation (EM) algorithm; generalized Gaussian mixture model (GGMM); soft competition; speaker modeling; speaker recognition; speaker-specific information;
fLanguage
English
Journal_Title
Systems, Man, and Cybernetics, Part C: Applications and Reviews, IEEE Transactions on
Publisher
ieee
ISSN
1094-6977
Type
jour
DOI
10.1109/TSMCC.2005.848166
Filename
1487579
Link To Document