مرکز منطقه ای اطلاع رساني علوم و فناوري - Multi-stream parameterization for structural speech recognition

DocumentCode :

3422471

Title :

Multi-stream parameterization for structural speech recognition

Author :

Asakawa, Satoshi ; Minematsu, Nobuaki ; Hirose, Keikichi

Author_Institution :

Grad. Sch. of Frontier Sci., Univ. of Tokyo, Tokyo

fYear :

2008

fDate :

March 31 2008-April 4 2008

Firstpage :

4097

Lastpage :

4100

Abstract :

Recently, a novel and structural representation of speech was proposed [1,2], where the inevitable acoustic variations caused by non- linguistic factors are effectively removed from speech. This structural representation captures only microphone- and speaker-invariant speech contrasts or dynamics and uses no absolute or static acoustic properties directly such as spectrums. In our previous study, the new representation was applied to recognizing a sequence of isolated vowels [3]. The structural models trained with a single speaker outperformed the conventional HMMs trained with more than four thousand speakers even in the case of noisy speech. We also applied the new models to recognizing utterances of connected vowels [4]. In the current paper, a multiple stream structuralization method is proposed to improve the performance of the structural recognition framework. The proposed method only with 8 training speakers shows the very comparable performance to that of the conventional 4,130-speaker triphone-based HMMs.

Keywords :

hidden Markov models; speech recognition; HMM; multistream parameterization; speech representation; structural speech recognition; Acoustic distortion; Acoustic noise; Additive noise; Auditory system; Hidden Markov models; Humans; Loudspeakers; Microphones; Robustness; Speech recognition; multiple stream structuralization; robust invariance; speech recognition; the structural representation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on

Conference_Location :

Las Vegas, NV

ISSN :

1520-6149

Print_ISBN :

978-1-4244-1483-3

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2008.4518555

Filename :

4518555

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3422471