• DocumentCode
    2791439
  • Title

    Investigations on ensemble based unsupervised adaptation methods

  • Author

    Kubota, Yu ; Shinozaki, Takahiro ; Furui, Sadaoki

  • Author_Institution
    Grad. Sch. of Inf. Sci. & Eng., Tokyo Inst. of Technol., Tokyo, Japan
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4874
  • Lastpage
    4877
  • Abstract
    We have previously proposed unsupervised cross-validation (CV) adaptation that introduces CV into an iterative unsupervised batch mode adaptation framework to suppress the influence of errors in an internally generated recognition hypothesis and have shown that it improves recognition performance. However, a limitation was that the experiments were performed using only a clean speech recognition task with a ML trained initial acoustic model. Another limitation was that only the CV method was investigated while there was a possibility of using other ensemble methods. In this study, we evaluate the CV method using a discriminatively trained baseline and a noisy speech recognition task. As an alternative to CV adaptation, unsupervised aggregated (Ag) adaptation is proposed and investigated that introduces a bagging like idea instead of CV. Experimental results show that CV and Ag adaptations consistently give larger improvements than the conventional batch adaptation but the former is more advantageous in terms of computational cost.
  • Keywords
    iterative methods; speech recognition; unsupervised learning; ML trained initial acoustic model; clean speech recognition task; internally generated recognition hypothesis; iterative unsupervised batch mode adaptation framework; noisy speech recognition task; unsupervised aggregated adaptation; unsupervised cross validation; Acoustic noise; Bagging; Computational efficiency; Computer errors; Information science; Iterative decoding; Iterative methods; Maximum likelihood decoding; Parameter estimation; Speech recognition; Cross-validation; acoustic model; bagging; machine learning ensemble; unsupervised adaptation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495118
  • Filename
    5495118