DocumentCode :
3111913
Title :
Soft-weighting technique for robust children speech recognition under mismatched condition
Author :
Kathania, Hemant Kumar ; Ghai, Sunil ; Sinha, Roopak
Author_Institution :
Dept. of Electron. & Electr. Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
fYear :
2013
fDate :
13-15 Dec. 2013
Firstpage :
1
Lastpage :
6
Abstract :
The children´s speech recognition performance under mismatched condition i.e., recognizing on the adults´ speech trained models is a challenging task. It is well known that MFCC features contain all the information regarding speech and mismatch factor at the same time. Therefore, in this work, the truncation of MFCC features is explored for children´s speech recognition on adults´ speech trained models. It has already been noted that cepstral truncation gives improved result but excessive increase in cepstral truncation will cause the loss of the relevant spectral information. Motivated by this, in this work, we explored soft-weighing technique that is heteroscedastic linear discriminant analysis (HLDA) for optimizing losses of cepstral information during truncation. In this paper, an HLDA transformation based technique to reduce mismatch condition is proposed. Further, we have tried to develop a linear relationship between HLDA transformation subspace of MFCC features and the VTLN warp factor values. Finally, a scheme to concatenate VTLN and CMLLR is explored. The proposed approach was found to give improvement in performance by 42.18% and 18.88% in the case of connected digit recognition and continuous speech recognition respectively in comparison to direct cepstral truncation.
Keywords :
cepstral analysis; speech recognition; HLDA transformation based technique; HLDA transformation subspace; MFCC features; VTLN warp factor values; adult speech trained models; cepstral information; children speech recognition performance; connected digit recognition; continuous speech recognition; direct cepstral truncation; heteroscedastic linear discriminant analysis; linear relationship; mismatch condition; mismatched condition; robust children speech recognition; soft-weighting technique; spectral information; Covariance matrices; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Speech recognition; Automatic speech recognition; acoustic mismatch; cepstral truncation; children´s speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
India Conference (INDICON), 2013 Annual IEEE
Conference_Location :
Mumbai
Print_ISBN :
978-1-4799-2274-1
Type :
conf
DOI :
10.1109/INDCON.2013.6726063
Filename :
6726063
Link To Document :
بازگشت