DocumentCode :
2943146
Title :
Structure optimization of metamodels to improve speech recognition accuracy
Author :
Morales, Santiago Omar Caballero
Author_Institution :
Postgrad. Div., Technol. Univ. of the Mixtec Region, Huajuapan de León, Mexico
fYear :
2011
fDate :
Feb. 28 2011-March 2 2011
Firstpage :
125
Lastpage :
130
Abstract :
The metamodels is a technique that was developed to model a speaker´s phoneme confusion-matrix and use this information to increase speech recognition accuracy for speakers with disordered and normal speech. Approaches to improve the performance of the metamodels, mainly focused on obtaining better estimates of the speaker´s confusion-matrix, were studied. While some achieved significant improvements, alternatives to the functional structure of the metamodels were not explored. In this paper is proposed a different structure for the metamodel of a phoneme and its optimization by means of a genetic algorithm. Results showed statistically significant gains in speech recognition accuracy over the previous metamodels.
Keywords :
genetic algorithms; speech recognition; genetic algorithm; metamodels; speaker phoneme confusion-matrix; speech recognition; structure optimization; Accuracy; Context; Gallium; Hidden Markov models; Optimization; Speech recognition; Training;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical Communications and Computers (CONIELECOMP), 2011 21st International Conference on
Conference_Location :
San Andres Cholula
Print_ISBN :
978-1-4244-9558-0
Type :
conf
DOI :
10.1109/CONIELECOMP.2011.5749348
Filename :
5749348
Link To Document :
بازگشت