• DocumentCode
    66923
  • Title

    Dynamic Combination of Automatic Speech Recognition Systems by Driven Decoding

  • Author

    Lecouteux, Benjamin ; Linares, Georges ; Esteve, Y. ; Gravier, Guillaume

  • Author_Institution
    GETALP Team, Univ. of Grenoble Alpes, Grenoble, France
  • Volume
    21
  • Issue
    6
  • fYear
    2013
  • fDate
    Jun-13
  • Firstpage
    1251
  • Lastpage
    1260
  • Abstract
    Combining automatic speech recognition (ASR) systems generally relies on the posterior merging of the outputs or on acoustic cross-adaptation. In this paper, we propose an integrated approach where outputs of secondary systems are integrated in the search algorithm of a primary one. In this driven decoding algorithm (DDA), the secondary systems are viewed as observation sources that should be evaluated and combined to others by a primary search algorithm. DDA is evaluated on a subset of the ESTER I corpus consisting of 4 hours of French radio broadcast news. Results demonstrate DDA significantly outperforms vote-based approaches: we obtain an improvement of 14.5% relative word error rate over the best single-systems, as opposed to the the 6.7% with a ROVER combination. An in-depth analysis of the DDA shows its ability to improve robustness (gains are greater in adverse conditions) and a relatively low dependency on the search algorithm. The application of DDA to both and beam-search-based decoder yields similar performances.
  • Keywords
    search problems; speech coding; speech recognition; ASR system; DDA; ESTER I corpus; ROVER combination; automatic speech recognition system; beam-search-based decoder; driven decoding algorithm; dynamic combination; search algorithm; word error rate; Acoustics; Adaptation models; Decoding; Hidden Markov models; Pragmatics; Speech; Speech recognition; Automatic speech recognition; speech processing; system combination;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2248716
  • Filename
    6469173