DocumentCode :
294525
Title :
Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but limited training data
Author :
Wang, Hsin Min ; Shen, Jia Lin ; Yang, Yen Ju ; Tseng, Chiu Yu ; Lee, Lin Shan
Author_Institution :
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
61
Abstract :
This paper presents the first known results for complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but very limited training data. Although some isolated-syllable-based or isolated-word-based large-vocabulary Mandarin speech recognition systems have been successfully developed, a continuous-speech-based system of this kind has never been reported before. For successful development of this system, several important techniques have been used, including acoustic modeling of a set of sub-syllabic models for base syllable recognition and another set of context-dependent models for tone recognition, a multiple candidate searching technique based on a concatenated syllable matching algorithm to synchronize base syllable and tone recognition, and a word-class-based Chinese language model for linguistic decoding. The best recognition accuracy achieved is 88.69% for finally decoded Chinese characters, with 88.69%, 91.57%, and 81.37% accuracy for base syllables, tones, and tonal syllables respectively
Keywords :
context-sensitive languages; decoding; hidden Markov models; natural languages; speech recognition; Chinese language; HMM; acoustic modeling; base syllable recognition; complete recognition; concatenated syllable matching algorithm; context-dependent models; continuous Mandarin speech; limited training data; linguistic decoding; multiple candidate searching technique; recognition accuracy; sub-syllabic models; tone recognition; very large vocabulary; word-class-based Chinese language model; Character recognition; Computer science; Concatenated codes; Context modeling; Data engineering; Decoding; Natural languages; Speech recognition; Training data; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479273
Filename :
479273
Link To Document :
بازگشت