DocumentCode :
3179173
Title :
Exploration of class specific ABWE for robust children´s ASR under mismatched condition
Author :
Sunil, Y. ; Sinha, R.
Author_Institution :
Dept. of Electron. & Electr. Eng., Indian Inst. of Technol. Guwahati, Guwahati, India
fYear :
2012
fDate :
22-25 July 2012
Firstpage :
1
Lastpage :
5
Abstract :
Recently we have explored the use of a Gaussian mixture model (GMM) based global transformer for artificial bandwidth extension (ABWE) for improving the automatic recognition of children´s speech in mismatched condition. As the spectral characteristic of the speech varies significantly from one sound class to another so the global transformation would be sub-optimal for that purpose. Motivated by that in this work, we explore the use of class specific GMM based ABWE transformers for the bandwidth extension of the narrowband speech. For the deriving the class specific ABWE transformers an existing unsupervised hidden Markov model (HMM) based method is used. Further for contrast purpose an supervised class specific GMM based ABWE transformers are also explored. The unsupervised and supervised class specific ABWE approaches have resulted in 21.30% and 26.37% relative improvement in word error rate on digit recognition task. The effectiveness of class specific ABWE is also explored in terms of the mutual information between narrowband and the extended higherband speech as well as a group of other speech quality measures.
Keywords :
Gaussian processes; error statistics; hidden Markov models; speech recognition; ABWE; GMM; Gaussian mixture model; automatic speech recognition; class specific artificial bandwidth extension; digit recognition task; global transformation; mismatched condition; mutual information; narrowband speech; robust children ASR; spectral characteristic; speech quality measures; unsupervised hidden Markov model; word error rate; Hidden Markov models; Mutual information; Narrowband; Niobium; Speech; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications (SPCOM), 2012 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4673-2013-9
Type :
conf
DOI :
10.1109/SPCOM.2012.6290226
Filename :
6290226
Link To Document :
بازگشت