Title :
Robust speech recognition using noise suppression based on multiple composite models and multi-pass search
Author :
Jitsuhiro, Takatoshi ; Toriyama, Tomoji ; Kogure, Kiyoshi
Author_Institution :
ATR Knowledge Sci. Lab., Kyoto
Abstract :
This paper presents robust speech recognition using a noise suppression method based on multi-model compositions and multi-pass search. In real environments, many kinds of noise signals exists, and input speech for speech recognition systems include them. Our task in the E-Nightingale project is speech recognition of voice memoranda spoken by nurses during actual work at hospitals. To obtain good recognized candidates, suppressing many kinds of noise signals at once to find target speech is important. First, before noise suppression, to find speech and noise label sequences, we introduce multi-pass search with acoustic models including many kinds of noise models and their compositions, their n-gram models, and their lexicon. Second, noise suppression based on models is performed using the multiple composite models selected by recognized label sequences with time alignments. We evaluated this approach using the E-Nightingale task, and the proposed method outperformed the conventional method.
Keywords :
Gaussian processes; interference suppression; mean square error methods; search problems; speech recognition; E-Nightingale project; GMM-based MMSE method; Gaussian mixture model; acoustic models; multipass search; multiple composite models; noise suppression; speech recognition; voice memoranda; Acoustic noise; Laboratories; Medical services; Microphones; Noise robustness; Signal detection; Speech analysis; Speech enhancement; Speech recognition; Working environment noise; E-Nightingale project; model composition; multi-pass search; noise suppression; speech recognition;
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
DOI :
10.1109/ASRU.2007.4430083