مرکز منطقه ای اطلاع رساني علوم و فناوري - Random discriminant structure analysis for automatic recognition of connected vowels

DocumentCode :

2770199

Title :

Random discriminant structure analysis for automatic recognition of connected vowels

Author :

Qiao, Yu ; Asakawa, Satoshi ; Minematsu, Nobuaki

Author_Institution :

Univ. of Tokyo, Tokyo

fYear :

2007

fDate :

9-13 Dec. 2007

Firstpage :

576

Lastpage :

581

Abstract :

The universal structure of speech [1, 2], proves to be invariant to transformations in feature space, and thus provides a robust representation for speech recognition. One of the difficulties of using structure representation is due to its high dimensionality. This not only increases computational cost but also easily suffers from the curse of dimensionality [3, 4]. In this paper, we introduce random discriminant structure analysis (RDSA) to deal with this problem. Based on the observation that structural features are highly correlated and include large redundancy, the RDSA combines random feature selection and discriminative analysis to calculate several low dimensional and discriminative representations from an input structure. Then an individual classifier is trained for each representation and the outputs of each classifier are integrated for the final classification decision. Experimental results on connected Japanese vowel utterances show that our approach achieves a recognition rate of 98.3% based on the training data of 8 speakers, which is higher than that (97.4%) of HMMs trained with the utterances of 4,130 speakers.

Keywords :

feature extraction; random processes; signal classification; signal representation; speech recognition; automatic speech recognition; connected vowels representation; random discriminant structure analysis; random feature selection; speech classification; Automatic speech recognition; Communication channels; Computational efficiency; Hidden Markov models; Redundancy; Robustness; Speech analysis; Speech recognition; Statistical analysis; Training data; Classifier ensemble; Discriminative analysis; Feature selection; Invariant structure; Speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on

Conference_Location :

Kyoto

Print_ISBN :

978-1-4244-1746-9

Electronic_ISBN :

978-1-4244-1746-9

Type :

conf

DOI :

10.1109/ASRU.2007.4430176

Filename :

4430176

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2770199