• DocumentCode
    323562
  • Title

    Language adaptation of multilingual phone models for vocabulary independent speech recognition tasks

  • Author

    Köhler, Joachim

  • Author_Institution
    Corp. Technol., Siemens AG, Munich, Germany
  • Volume
    1
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    417
  • Abstract
    This paper presents our new results on multilingual phone modeling and adaptation into a new target language which is not included in the trained multilingual models. The experiments were carried out with the SpeechDat(M) and MacroPhone databases including the languages of French, German, Italian, Portuguese, Spanish and American English. First, we constructed language-dependent and multilingual phone models. The recognition rate for an isolated word task decreased in average only by 3.2% using 95 multilingual instead of 232 language-dependent models. Second, we investigated adaptation techniques for cross-language transfer and showed that only 100 utterances from a new language were needed for adaptation. Using the MAP algorithm the recognition rate was improved from 79.9% to 84.3%. Finally, we defined a phonetic based dissimilarity measure between 2 languages and compared language-dependent and multilingual models for the purpose of cross-language transfer
  • Keywords
    maximum likelihood estimation; natural languages; speech recognition; American English; French; German; Italian; MAP algorithm; MacroPhone database; Portuguese; Spanish; SpeechDat database; cross-language transfer; experiments; isolated word task; language adaptation; language-dependent phone model; multilingual phone models; phonetic based dissimilarity measure; recognition rate; trained multilingual models; vocabulary independent speech recognition; Adaptation model; Automatic speech recognition; Databases; Natural languages; Robustness; Speech recognition; Telephony; Training data; Uniform resource locators; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.674456
  • Filename
    674456