DocumentCode
353508
Title
Acoustic modeling for Chinese speech recognition: a comparative study of Mandarin and Cantonese
Author
Gao, Sheng ; Lee, Tan ; Wong, I.K. ; Bo Xu ; Ching, P.C. ; Huang, Taiyi
Author_Institution
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Shatin, China
Volume
3
fYear
2000
fDate
2000
Firstpage
1261
Abstract
This paper presents a comparative study on automatic speech recognition for two different Chinese dialects, namely Mandarin and Cantonese. It focuses on decision-tree based context-dependent acoustic modeling for large-vocabulary continuous speech recognition. Extensive phonological and phonetic knowledge are incorporated to design questions concerning the left and right context of sub-syllable units, namely INITIALs and FINALs. This results in a set of class-triphone models for each dialect. Syllable recognition accuracy of 81.7% and 75.5% are attained for Mandarin and Cantonese respectively. Such a performance gap is accountable by various linguistic and practical reasons, including: 1) phonological and phonetic discrepancies between the two dialects; 2) design of training databases; and 3) design of phonetic questions in decision-tree clustering
Keywords
decision trees; natural languages; speech recognition; Cantonese; Chinese dialects; Chinese speech recognition; FINALs; INITIALs; Mandarin; acoustic modeling; class-triphone models; decision-tree based context-dependent acoustic modeling; decision-tree clustering; large-vocabulary continuous speech recognition; performance; phonetic discrepancies; phonetic knowledge; phonological discrepancies; phonological knowledge; sub-syllable units; syllable recognition accuracy; training databases; Acoustic testing; Acoustical engineering; Automatic testing; Context modeling; Data engineering; Databases; Electronic equipment testing; Laboratories; Pattern recognition; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location
Istanbul
ISSN
1520-6149
Print_ISBN
0-7803-6293-4
Type
conf
DOI
10.1109/ICASSP.2000.861805
Filename
861805
Link To Document