• DocumentCode
    3162834
  • Title

    Multilingual MLP features for low-resource LVCSR systems

  • Author

    Thomas, Samuel ; Ganapathy, Sriram ; Hermansky, Hynek

  • Author_Institution
    Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    4269
  • Lastpage
    4272
  • Abstract
    We introduce a new approach to training multilayer perceptrons (MLPs) for large vocabulary continuous speech recognition (LVCSR) in new languages which have only few hours of annotated in-domain training data (for example, 1 hour of data). In our approach, large amounts of annotated out-of-domain data from multiple languages are used to train multilingual MLP systems without dealing with the different phoneme sets for these languages. Features extracted from these MLP systems are used to train LVCSR systems in the low-resource language similar to the Tandem approach. In our experiments, the proposed features provide a relative improvement of about 30% in an low-resource LVCSR setting with only one hour of training data.
  • Keywords
    feature extraction; multilayer perceptrons; speech recognition; annotated in-domain training data; annotated out-of-domain data; feature extraction; large vocabulary continuous speech recognition; low-resource LVCSR systems; low-resource language; multilayer perceptrons; multilingual MLP features; multilingual MLP systems; multiple languages; phoneme sets; tandem approach; Abstracts; Hidden Markov models; Nonhomogeneous media; Speech; Switches; Vocabulary; MLP features for low-resource LVCSR; Multilingual training; multilayer perceptrons;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288862
  • Filename
    6288862