• DocumentCode
    179901
  • Title

    Training time reduction and performance improvements from multilingual techniques on the BABEL ASR task

  • Author

    Stuker, Sebastian ; Muller, Mathias ; Quoc Bao Nguyen ; Waibel, Alex

  • Author_Institution
    Inst. for Anthropomatics, Karlsruhe Inst. of Technol., Karlsruhe, Germany
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    6374
  • Lastpage
    6378
  • Abstract
    In the IARPA sponsored program BABEL we are faced with the challenge of training automatic speech recognition systems in sparse data conditions in very little time. In this paper we show that by using multilingual bootstrapping techniques in combination with multilingual deep belief bottle neck features that are only fine tuned on the target language the training time of an LVCSR system can be essentially halved while the word error rate stays the same. We show this for recognition systems on Tagalog, making use of multilingual systems trained on the other four languages of the Babel base period: Cantonese, Pashto, Turkish, and Vietnamese.
  • Keywords
    computational linguistics; error statistics; speech recognition; statistical analysis; training; vocabulary; BABEL ASR; Babel base period; Cantonese; IARPA; LVCSR system; Pashto; Tagalog; Turkish; Vietnamese; automatic speech recognition; multilingual bootstrapping technique; multilingual deep belief bottle neck feature; multilingual systems; performance improvement; sparse data conditions; training time reduction; word error rate; Acoustics; Context modeling; Data models; Hidden Markov models; Speech recognition; Training; Training data; automatic speech recognition; multilingual speech recognition; rapid system development;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854831
  • Filename
    6854831