Title :
Modeling inter-speaker variability in speech recognition
Author :
Cloarec, Gwenael ; Jouvet, Denis
Author_Institution :
Div. R&D, France Telecom, Lannion
fDate :
March 31 2008-April 4 2008
Abstract :
This paper details a method for taking into account variability influence in HMM-based speech recognition. The set of Gaussian components of the mixtures represents the entire acoustic space covered for all possible variability values. For each utterance to be recognized, the corresponding variability value is estimated and used to weight and/or constrain dynamically the acoustic space for each pdf. To do that, the weight coefficients of the Gaussian mixtures are set dependent on the variability value. As an example, the variability considered is the inter-speaker variability, and is handled through speaker classes. Taking into account for each utterance the four speaker classes that best match with the utterance signal leads to a significant word error rate reduction on a continuous speech recognition task, as compared to standard speaker-independent modeling.
Keywords :
Gaussian processes; error statistics; hidden Markov models; speech recognition; Gaussian components; Gaussian mixtures; HMM-based speech recognition; continuous speech recognition task; inter-speaker variability modeling; speaker-independent modeling; utterance signal; word error rate reduction; Acoustic testing; Bayesian methods; Decoding; Error analysis; Hidden Markov models; Loudspeakers; Performance evaluation; Research and development; Speech recognition; Telecommunications; Speech recognition; acoustic modeling; dynamic Bayesian network; inter-speaker variability; speaker class;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4518663