• DocumentCode
    507054
  • Title

    Investigations to Minimum Phone Error Training in Bilingual Speech Recognition

  • Author

    Xu, Ran ; Zhang, Qingqing ; Pan, Jielin ; Yan, Yonghong

  • Author_Institution
    ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
  • Volume
    4
  • fYear
    2009
  • fDate
    14-16 Aug. 2009
  • Firstpage
    486
  • Lastpage
    490
  • Abstract
    The great success of Minimum Phone Error (MPE) training criterion in mono-language large vocabulary continuous speech recognition (LVCSR) tasks motivates us to apply it to bilingual LVCSR systems. In this paper, in conjunction with the previous respectable bilingual phoneme inventory construction techniques, we give a comprehensive investigation to the performance of MPE/fMPE on various Mandarin-English bilingual test sets under different test conditions. The evaluation results show that the final fMPE+MPE model achieves significant improvements compared to the baseline models. On the mono-language test sets, the best improvement is a relative error rate reduction of 28.4%. And on the code-mixing test set, it also achieves a relative error rate reduction of 8.1%. The within- and cross-language substitution error rate introduced in this paper also explicitly shows that fMPE/MPE training can effectively improve the model´s within- and cross-language discriminabilities in our bilingual recognition tasks.
  • Keywords
    natural languages; speech recognition; Mandarin-English bilingual test sets; bilingual phoneme inventory construction; bilingual speech recognition; fMPE; minimum phone error training; mono-language large vocabulary continuous speech recognition; Acoustic testing; Error analysis; Fuzzy systems; Loudspeakers; Maximum likelihood estimation; Natural languages; Radio access networks; Speech recognition; System testing; Vocabulary; discriminative training; minimum phone error; multilingual recognition; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2009. FSKD '09. Sixth International Conference on
  • Conference_Location
    Tianjin
  • Print_ISBN
    978-0-7695-3735-1
  • Type

    conf

  • DOI
    10.1109/FSKD.2009.434
  • Filename
    5359221