• DocumentCode
    1920692
  • Title

    Features Extraction, Modeling and Training Strategies in Continuous Speech Recognition for Romanian Language

  • Author

    Dumitru, Comeliu Octavian ; Gavat, Inge

  • Author_Institution
    Fac. of Electron., Telecommun., & Inf. Technol., Bucharest Politehnica Univ.
  • Volume
    2
  • fYear
    2005
  • fDate
    21-24 Nov. 2005
  • Firstpage
    1425
  • Lastpage
    1428
  • Abstract
    This paper describes continuous speech recognition experiments for Romanian language, by using HMM modeling. The following questions are to be discussed: the realization of a new front-end reconsidering linear prediction, the enhancement of recognition rates by context dependent modeling, the evaluation of training strategies ensuring speaker independence of the recognition process without speaker adaptation procedures, by speaker selection for training. The experiments lead to a development of the initial system with a promising front-end based on PLP coefficients, second ranked for the recognition performance obtained, near the first ranked front-end based on mel-frequency cepstral coefficients (MFCC), but far better as the last ranked, based on simple linear prediction. Concerning the implemented algorithm for context dependent modeling, it permits in all situations enhanced recognition rates. The experiments made with gender speaker selection enhanced under certain conditions the recognition rate, proving good generalization properties especially by training with the male speakers database
  • Keywords
    feature extraction; hidden Markov models; learning (artificial intelligence); natural languages; speech recognition; HMM modeling; PLP coefficients; Romanian language; context dependent modeling; continuous speech; feature extraction; linear prediction; mel-frequency cepstral coefficients; speech recognition; Context modeling; Databases; Educational technology; Feature extraction; Hidden Markov models; Linear predictive coding; Loudspeakers; Mel frequency cepstral coefficient; Natural languages; Speech recognition; HMM; LPC; MFCC; PLP; context dependent modeling; continuous speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer as a Tool, 2005. EUROCON 2005.The International Conference on
  • Conference_Location
    Belgrade
  • Print_ISBN
    1-4244-0049-X
  • Type

    conf

  • DOI
    10.1109/EURCON.2005.1630229
  • Filename
    1630229