مرکز منطقه ای اطلاع رساني علوم و فناوري - Noise and room acoustics distorted speech recognition by HMM composition

DocumentCode :

3347450

Title :

Noise and room acoustics distorted speech recognition by HMM composition

Author :

Nakamura, Satoshi ; Takiguchi, Tetsuya ; Shikano, Kiyohiro

Author_Institution :

Graduate Sch. of Inf. Sch., Nara Inst. of Sci. & Technol., Japan

Volume :

fYear :

1996

fDate :

7-10 May 1996

Firstpage :

Abstract :

This paper presents a robust speech recognition method based on the HMM composition for the noisy room acoustics distorted speech. The method realizes an improved user interface such as the user is not encumbered by microphone equipment. The proposed HMM composition is obtained by naturally extending the HMM composition method of an additive noise to that of the convolutional room acoustics distortion. The HMM composition is conducted by 2 steps: (1) composition of HMMs of a speech and acoustical transfer function in the cepstrum domain, and (2) composition of distorted speech and noise HMMs in the linear spectral domain. The speaker dependent/independent word recognition experiments are carried out using the speech database contaminated by the additive noise and convolutional room acoustics distortion. The evaluation experiments are also conducted for unknown testing sound source positions. These results clarified the effectiveness of the proposed method

Keywords :

acoustic noise; architectural acoustics; cepstral analysis; convolution; hidden Markov models; speech processing; speech recognition; transfer functions; HMM composition; acoustical transfer function; additive noise; cepstrum domain; convolutional room acoustics distortion; distorted speech HMM; distorted speech recognition; linear spectral domain; noise HMM; noisy room acoustics distorted speech; robust speech recognition method; speaker dependent word recognition experiments; speaker independent word recognition experiments; speech database; speech transfer function; testing sound source positions; Acoustic distortion; Acoustic noise; Additive noise; Hidden Markov models; Microphones; Noise robustness; Speech enhancement; Speech recognition; Transfer functions; User interfaces;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on

Conference_Location :

Atlanta, GA

ISSN :

1520-6149

Print_ISBN :

0-7803-3192-3

Type :

conf

DOI :

10.1109/ICASSP.1996.540292

Filename :

540292

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3347450