DocumentCode :
3347450
Title :
Noise and room acoustics distorted speech recognition by HMM composition
Author :
Nakamura, Satoshi ; Takiguchi, Tetsuya ; Shikano, Kiyohiro
Author_Institution :
Graduate Sch. of Inf. Sch., Nara Inst. of Sci. & Technol., Japan
Volume :
1
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
69
Abstract :
This paper presents a robust speech recognition method based on the HMM composition for the noisy room acoustics distorted speech. The method realizes an improved user interface such as the user is not encumbered by microphone equipment. The proposed HMM composition is obtained by naturally extending the HMM composition method of an additive noise to that of the convolutional room acoustics distortion. The HMM composition is conducted by 2 steps: (1) composition of HMMs of a speech and acoustical transfer function in the cepstrum domain, and (2) composition of distorted speech and noise HMMs in the linear spectral domain. The speaker dependent/independent word recognition experiments are carried out using the speech database contaminated by the additive noise and convolutional room acoustics distortion. The evaluation experiments are also conducted for unknown testing sound source positions. These results clarified the effectiveness of the proposed method
Keywords :
acoustic noise; architectural acoustics; cepstral analysis; convolution; hidden Markov models; speech processing; speech recognition; transfer functions; HMM composition; acoustical transfer function; additive noise; cepstrum domain; convolutional room acoustics distortion; distorted speech HMM; distorted speech recognition; linear spectral domain; noise HMM; noisy room acoustics distorted speech; robust speech recognition method; speaker dependent word recognition experiments; speaker independent word recognition experiments; speech database; speech transfer function; testing sound source positions; Acoustic distortion; Acoustic noise; Additive noise; Hidden Markov models; Microphones; Noise robustness; Speech enhancement; Speech recognition; Transfer functions; User interfaces;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.540292
Filename :
540292
Link To Document :
بازگشت