• DocumentCode
    2650693
  • Title

    Efficient adaptations of the SphinxTrain procedure for building a robust ASR system in Slovak

  • Author

    Kacur, J. ; Vojtko, J.

  • Author_Institution
    Dept. of Telecommun., Slovak Univ. of Technol., Bratislava
  • fYear
    2008
  • fDate
    25-28 June 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In the following article we discuss the practical and theoretical aspects of the building of Slovak ASR system using the SPHINX system and its SphinxTrain adaptation procedure for CI and CD HMM models. Concerning issues are ranging from the optimal setting of the number of states per model, through the adjustment of the number of tied states for context dependent HMMspsila, number of Gaussian mixtures, and training scenarios regarding the spelled recordings and background models. All experiments and results were obtained for the MOBILDAT-SK speech database that contains 32500 recordings from 1100 speakers. Obtained CI and CD HMM models achieved WER around 5% for application words which qualifies them for a use in practical applications. Furthermore the suggested and realized modifications to the classical SphinxTrain procedure for a given database and the Slovak language brought improved overall results as well.
  • Keywords
    Gaussian processes; hidden Markov models; natural language processing; speech recognition; Gaussian mixtures; HMM model; SPHINX system; Slovak ASR system; SphinxTrain procedure; context dependent HMM; robust ASR system; Automatic speech recognition; CD recording; Context modeling; Databases; Decision trees; Hidden Markov models; Information technology; Robustness; Speech processing; Speech recognition; Automatic speech recognition; HMM; MobilDat; SPHINX; SphinxTrain;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Signals and Image Processing, 2008. IWSSIP 2008. 15th International Conference on
  • Conference_Location
    Bratislava
  • Print_ISBN
    978-80-227-2856-0
  • Electronic_ISBN
    978-80-227-2880-5
  • Type

    conf

  • DOI
    10.1109/IWSSIP.2008.4604352
  • Filename
    4604352