• DocumentCode
    10204
  • Title

    A Real-Time End-to-End Multilingual Speech Recognition Architecture

  • Author

    Gonzalez-Dominguez, Javier ; Eustis, David ; Lopez-Moreno, Ignacio ; Senior, Andrew ; Beaufays, Francoise ; Moreno, Pedro J.

  • Author_Institution
    Google Inc., New York, NY, USA
  • Volume
    9
  • Issue
    4
  • fYear
    2015
  • fDate
    Jun-15
  • Firstpage
    749
  • Lastpage
    759
  • Abstract
    Automatic speech recognition (ASR) systems are used daily by millions of people worldwide to dictate messages, control devices, initiate searches or to facilitate data input in small devices. The user experience in these scenarios depends on the quality of the speech transcriptions and on the responsiveness of the system. For multilingual users, a further obstacle to natural interaction is the monolingual character of many ASR systems, in which users are constrained to a single preset language. In this work, we present an end-to-end multi-language ASR architecture, developed and deployed at Google, that allows users to select arbitrary combinations of spoken languages. We leverage recent advances in language identification and a novel method of real-time language selection to achieve similar recognition accuracy and nearly-identical latency characteristics as a monolingual system.
  • Keywords
    speech recognition; Google; automatic speech recognition systems; end-to-end multilanguage ASR architecture; language identification; monolingual character; monolingual system; nearly-identical latency characteristics; real-time end-to-end multilingual speech recognition architecture; recognition accuracy; single preset language; speech transcriptions; spoken languages; Computer architecture; Google; Pipelines; Real-time systems; Signal processing; Speech; Speech recognition; Automatic speech recognition (ASR); deep neural network (DNN); language identification (LID); multilingual;
  • fLanguage
    English
  • Journal_Title
    Selected Topics in Signal Processing, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    1932-4553
  • Type

    jour

  • DOI
    10.1109/JSTSP.2014.2364559
  • Filename
    6935076