DocumentCode :
417284
Title :
Chinese-English bilingual phone modeling for cross-language speech recognition
Author :
Yu, Shengmin ; Zhang, Shuwu ; Xu, Bo
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
Volume :
1
fYear :
2004
fDate :
17-21 May 2004
Abstract :
In this paper, three different approaches to Chinese-English bilingual phone modeling are investigated and compared. The first approach is to simply combine Chinese and English phone inventories together without phone sharing across the languages. The second one is to map language-dependent phones to the inventory of the International Phonetic Association (IPA) based on phonetic knowledge to construct the bilingual phone inventory. The third one is to merge the language-dependent phone models by an hierarchical phone clustering algorithm to get a compact bilingual inventory. In the third approach, two distance measures are used to perform the bottom-up clustering. One is the Bhattacharyya distance. The other is the acoustic likelihood distance. Experimental results show that the phone clustering approach outperforms the IPA-based phone mapping approach, and it can also achieve comparable performance to the simple combination of language-dependent phone inventories with fewer model parameters, especially when using acoustic likelihood distance measurement.
Keywords :
maximum likelihood estimation; pattern clustering; speech processing; speech recognition; Bhattacharyya distance; Chinese-English bilingual phone modeling; IPA; International Phonetic Association; acoustic likelihood distance; bilingual phone inventory; bottom-up clustering; compact bilingual inventory; cross-language speech recognition; distance measures; hierarchical phone clustering; language-dependent phones; performance; phonetic knowledge; Acoustic measurements; Automatic speech recognition; Automation; Clustering algorithms; Distance measurement; Loudspeakers; Natural languages; Performance evaluation; Speech recognition; Technological innovation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1326136
Filename :
1326136
Link To Document :
بازگشت