DocumentCode
417176
Title
Language boundary detection and identification of mixed-language speech based on MAP estimation
Author
Shia, Chi-Jiun ; Chiu, Yu-Hsien ; Hsieh, Jia-Hsin ; Wu, Chung-Hsien
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
The paper proposes a maximum a posteriori (MAP) based approach to segment and identify jointly an utterance with mixed languages. A statistical framework for language boundary detection and language identification is proposed. First, the MAP estimation is used to determine the boundary number and positions. Further, an LSA-based GMM and a VQ-based bigram language model are proposed to characterize a language and used for language identification. Finally, a likelihood ratio test approach is used to determine the optimal number of language boundaries. Experimental results show that the proposed approach exhibits encouraging potential in mixed-language segmentation and identification.
Keywords
Gaussian processes; least squares approximations; maximum likelihood estimation; natural languages; speech recognition; statistical analysis; vector quantisation; GMM; LSA; MAP estimation; VQ; bigram language model; language boundary detection; language identification; likelihood ratio test; maximum a posteriori estimation; mixed-language speech; Acoustic signal detection; Application software; Computer science; Humans; Matrix converters; Natural languages; Probability; Robustness; Speech; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326002
Filename
1326002
Link To Document