• DocumentCode
    730728
  • Title

    Switching to and combining offline-adapted cluster acoustic models based on unsupervised segment classification

  • Author

    Jintao Jiang ; Sawaf, Hassan

  • Author_Institution
    Applic. Technol., McLean, VA, USA
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    4649
  • Lastpage
    4653
  • Abstract
    The performance of automatic speech recognition system degrades significantly when the incoming audio differs from training data. Maximum likelihood linear regression has been widely used for unsupervised adaptation, usually in a multiple-pass recognition process. Here we present a novel adaptation framework for which the offline, supervised, high-quality adaptation is applied to clustered channel/speaker conditions that are defined with automatic and manual clustering of the training data. Upon online recognition, each speech segment is classified into one of the training clusters in an unsupervised way, and the corresponding top acoustic models are used for recognition. Recognition lattice outputs are combined. Experiments are performed on the Wall Street Journal data, and a 37.5% relative reduction of Word Error Rate is reported. The proposed approach is also compared with a general speaker adaptive training approach.
  • Keywords
    acoustic signal processing; regression analysis; speech processing; Wall Street Journal data; automatic speech recognition system; maximum likelihood linear regression; multiple pass recognition process; offline adapted cluster acoustic models; online recognition; unsupervised segment classification; Acoustics; Adaptation models; Hidden Markov models; Speech; Speech recognition; Training; Training data; CMLLR; MLLR; ROVER; SAT; clustering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178852
  • Filename
    7178852