• DocumentCode
    3527151
  • Title

    Data sampling based ensemble acoustic modelling

  • Author

    Chen, Xin ; Zhao, Yunxin

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Missouri, Columbia, MO
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    3805
  • Lastpage
    3808
  • Abstract
    In this paper, we propose a novel technique of using cross validation (CV) data sampling to construct an ensemble of acoustic models for conversational speech recognition. We further propose using hierarchical Gaussian mixture model (HGMM) and repartition training data to increase the ensemble size and diversity. The proposed methods are found to work well together for ensemble acoustic modeling. We also evaluated the quality of the ensemble acoustic models by using the measures of classification margin, average correct score and variance of correct score. We have found that the ensemble of acoustic models increases the margin and the average correct score, and reduces the variance. We compared the performance of our proposed method with a recently reported method of CV expectation maximization (CVEM) for single acoustic models. Our experimental results on a telemedicine automatic captioning task showed that the proposed ensemble acoustic modeling has led to significant improvements in word recognition accuracy.
  • Keywords
    Gaussian processes; expectation-maximisation algorithm; speech recognition; telemedicine; word processing; CV expectation maximization; acoustic modeling; cross validation; data sampling; ensemble acoustic modelling; hierarchical Gaussian mixture model; speech recognition; telemedicine automatic captioning task; word recognition accuracy; Acoustic measurements; Computer science; Context modeling; Decision trees; Decoding; Hidden Markov models; Sampling methods; Speech recognition; Telemedicine; Training data; acoustic modeling; cross validation; data sampling; ensemble classifier; hierarchical mixture ensemble;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960456
  • Filename
    4960456