DocumentCode :
2770199
Title :
Random discriminant structure analysis for automatic recognition of connected vowels
Author :
Qiao, Yu ; Asakawa, Satoshi ; Minematsu, Nobuaki
Author_Institution :
Univ. of Tokyo, Tokyo
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
576
Lastpage :
581
Abstract :
The universal structure of speech [1, 2], proves to be invariant to transformations in feature space, and thus provides a robust representation for speech recognition. One of the difficulties of using structure representation is due to its high dimensionality. This not only increases computational cost but also easily suffers from the curse of dimensionality [3, 4]. In this paper, we introduce random discriminant structure analysis (RDSA) to deal with this problem. Based on the observation that structural features are highly correlated and include large redundancy, the RDSA combines random feature selection and discriminative analysis to calculate several low dimensional and discriminative representations from an input structure. Then an individual classifier is trained for each representation and the outputs of each classifier are integrated for the final classification decision. Experimental results on connected Japanese vowel utterances show that our approach achieves a recognition rate of 98.3% based on the training data of 8 speakers, which is higher than that (97.4%) of HMMs trained with the utterances of 4,130 speakers.
Keywords :
feature extraction; random processes; signal classification; signal representation; speech recognition; automatic speech recognition; connected vowels representation; random discriminant structure analysis; random feature selection; speech classification; Automatic speech recognition; Communication channels; Computational efficiency; Hidden Markov models; Redundancy; Robustness; Speech analysis; Speech recognition; Statistical analysis; Training data; Classifier ensemble; Discriminative analysis; Feature selection; Invariant structure; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430176
Filename :
4430176
Link To Document :
بازگشت