DocumentCode
1920692
Title
Features Extraction, Modeling and Training Strategies in Continuous Speech Recognition for Romanian Language
Author
Dumitru, Comeliu Octavian ; Gavat, Inge
Author_Institution
Fac. of Electron., Telecommun., & Inf. Technol., Bucharest Politehnica Univ.
Volume
2
fYear
2005
fDate
21-24 Nov. 2005
Firstpage
1425
Lastpage
1428
Abstract
This paper describes continuous speech recognition experiments for Romanian language, by using HMM modeling. The following questions are to be discussed: the realization of a new front-end reconsidering linear prediction, the enhancement of recognition rates by context dependent modeling, the evaluation of training strategies ensuring speaker independence of the recognition process without speaker adaptation procedures, by speaker selection for training. The experiments lead to a development of the initial system with a promising front-end based on PLP coefficients, second ranked for the recognition performance obtained, near the first ranked front-end based on mel-frequency cepstral coefficients (MFCC), but far better as the last ranked, based on simple linear prediction. Concerning the implemented algorithm for context dependent modeling, it permits in all situations enhanced recognition rates. The experiments made with gender speaker selection enhanced under certain conditions the recognition rate, proving good generalization properties especially by training with the male speakers database
Keywords
feature extraction; hidden Markov models; learning (artificial intelligence); natural languages; speech recognition; HMM modeling; PLP coefficients; Romanian language; context dependent modeling; continuous speech; feature extraction; linear prediction; mel-frequency cepstral coefficients; speech recognition; Context modeling; Databases; Educational technology; Feature extraction; Hidden Markov models; Linear predictive coding; Loudspeakers; Mel frequency cepstral coefficient; Natural languages; Speech recognition; HMM; LPC; MFCC; PLP; context dependent modeling; continuous speech;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer as a Tool, 2005. EUROCON 2005.The International Conference on
Conference_Location
Belgrade
Print_ISBN
1-4244-0049-X
Type
conf
DOI
10.1109/EURCON.2005.1630229
Filename
1630229
Link To Document