• DocumentCode
    417176
  • Title

    Language boundary detection and identification of mixed-language speech based on MAP estimation

  • Author

    Shia, Chi-Jiun ; Chiu, Yu-Hsien ; Hsieh, Jia-Hsin ; Wu, Chung-Hsien

  • Author_Institution
    Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    The paper proposes a maximum a posteriori (MAP) based approach to segment and identify jointly an utterance with mixed languages. A statistical framework for language boundary detection and language identification is proposed. First, the MAP estimation is used to determine the boundary number and positions. Further, an LSA-based GMM and a VQ-based bigram language model are proposed to characterize a language and used for language identification. Finally, a likelihood ratio test approach is used to determine the optimal number of language boundaries. Experimental results show that the proposed approach exhibits encouraging potential in mixed-language segmentation and identification.
  • Keywords
    Gaussian processes; least squares approximations; maximum likelihood estimation; natural languages; speech recognition; statistical analysis; vector quantisation; GMM; LSA; MAP estimation; VQ; bigram language model; language boundary detection; language identification; likelihood ratio test; maximum a posteriori estimation; mixed-language speech; Acoustic signal detection; Application software; Computer science; Humans; Matrix converters; Natural languages; Probability; Robustness; Speech; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326002
  • Filename
    1326002