• DocumentCode
    661362
  • Title

    A particle filter compensation approach to robust LVCSR

  • Author

    Duc Hoang Ha Nguyen ; Mushtaq, Aleem ; Xiong Xiao ; Eng Siong Chng ; Haizhou Li ; Chin-Hui Lee

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2013
  • fDate
    Oct. 29 2013-Nov. 1 2013
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    We extend our previous work on particle filter compensation (PFC) to large vocabulary continuous speech recognition (LVCSR) and conduct the experiments on Aurora-4 database. Obtaining an accurately aligned state and mixture sequence of hidden Markov models (HMMs) that describe the underlying clean speech features being estimated in noise is a challenging task for sub-word based LVCSR because the total number of triphone models involved can be very large. In this paper, we show that by using separate sets of HMMs for recognition and compensation, we can simplify the models used for PFC to a great extent and thus facilitate the estimation of the side information offered in the state and mixture sequences. When the missing side information for PFC is available, a large word error reduction of 28.46% from multi-condition training is observed. In the actual scenarios, an error reduction of only 5.3% is obtained. We are anticipating improved results that will narrow the gap between the system today and what´s achievable if the side information could be exactly specified.
  • Keywords
    compensation; hidden Markov models; particle filtering (numerical methods); speech recognition; Aurora-4 database; HMM; PFC; hidden Markov models; large vocabulary continuous speech recognition; mixture sequences; multicondition training; particle filter compensation approach; side information estimation; speech feature compensation; state sequences; sub-word based LVCSR; triphone models; word error reduction; Computational modeling; Hidden Markov models; Mel frequency cepstral coefficient; Noise; Noise measurement; Speech; clustering; particle filter; robustness; speech feature compensation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
  • Conference_Location
    Kaohsiung
  • Type

    conf

  • DOI
    10.1109/APSIPA.2013.6694223
  • Filename
    6694223