DocumentCode
1063179
Title
Approaches to Iterative Speech Feature Enhancement and Recognition
Author
Windmann, Stefan ; Haeb-Umbach, Reinhold
Author_Institution
Dept. of Commun. Eng., Univ. of Paderborn, Paderborn
Volume
17
Issue
5
fYear
2009
fDate
7/1/2009 12:00:00 AM
Firstpage
974
Lastpage
984
Abstract
In automatic speech recognition, hidden Markov models (HMMs) are commonly used for speech decoding, while switching linear dynamic models (SLDMs) can be employed for a preceding model-based speech feature enhancement. In this paper, these model types are combined in order to obtain a novel iterative speech feature enhancement and recognition architecture. It is shown that speech feature enhancement with SLDMs can be improved by feeding back information from the HMM to the enhancement stage. Two different feedback structures are derived. In the first, the posteriors of the HMM states are used to control the model probabilities of the SLDMs, while in the second they are employed to directly influence the estimate of the speech feature distribution. Both approaches lead to improvements in recognition accuracy both on the AURORA2 and AURORA4 databases compared to non-iterative speech feature enhancement with SLDMs. It is also shown that a combination with uncertainty decoding further enhances performance.
Keywords
hidden Markov models; iterative methods; speech enhancement; speech recognition; AURORA2 databases; AURORA4 databases; HMM; automatic speech recognition; feedback structures; hidden Markov models; iterative speech feature enhancement; model probabilities; speech decoding; speech feature distribution; switching linear dynamic models; Automatic speech recognition; Feedback; Hidden Markov models; Iterative decoding; Iterative methods; Spatial databases; Speech enhancement; Speech recognition; State estimation; Uncertainty; Dynamical systems; robust speech recognition;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2009.2014894
Filename
5067416
Link To Document