DocumentCode
2772135
Title
Adaptive Parallel Model Combination for reduced environmental mismatch in noisy speech recognition
Author
Tan, S.S. ; Ahmad, Abdul Manan
Author_Institution
Software Eng. Dept., Univ. of Technol. Malaysia, Johor Bahru
fYear
2008
fDate
1-3 Dec. 2008
Firstpage
1
Lastpage
6
Abstract
Due to environmental mismatch, speech recognition systems often exhibit drastic performance degradation in noisy conditions. This paper presents a model based technique termed adaptive parallel model combination (APMC) which compensates the initial acoustic models to reduce the discrepancy. APMC used the well-known PMC technique to composite a set of corrupted speech models, while fine tuning the mean parameter of the models using a transformation-based adaptation technique called Maximum Likelihood Spectral Transformation (MLST). Evaluated on a context-independent phone recognition task, APMC was found to be superior to both PMC and MLST, especially in non-stationary noisy conditions. On average, APMC has achieved 48.81% improvement over the initial models, whereas PMC and MLST have improved the accuracy by 34.12% and 35.23% respectively.
Keywords
hidden Markov models; maximum likelihood estimation; speech recognition; MLST; acoustic models; adaptive PMC technique; context-independent phone recognition task; left-right HMM; maximum likelihood spectral transformation; noisy speech recognition system; parallel model combination; Acoustic noise; Degradation; Hidden Markov models; Information systems; Noise reduction; Noise robustness; Software engineering; Speech enhancement; Speech recognition; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Electronic Design, 2008. ICED 2008. International Conference on
Conference_Location
Penang
Print_ISBN
978-1-4244-2315-6
Electronic_ISBN
978-1-4244-2315-6
Type
conf
DOI
10.1109/ICED.2008.4786758
Filename
4786758
Link To Document