DocumentCode
294525
Title
Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but limited training data
Author
Wang, Hsin Min ; Shen, Jia Lin ; Yang, Yen Ju ; Tseng, Chiu Yu ; Lee, Lin Shan
Author_Institution
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume
1
fYear
1995
fDate
9-12 May 1995
Firstpage
61
Abstract
This paper presents the first known results for complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but very limited training data. Although some isolated-syllable-based or isolated-word-based large-vocabulary Mandarin speech recognition systems have been successfully developed, a continuous-speech-based system of this kind has never been reported before. For successful development of this system, several important techniques have been used, including acoustic modeling of a set of sub-syllabic models for base syllable recognition and another set of context-dependent models for tone recognition, a multiple candidate searching technique based on a concatenated syllable matching algorithm to synchronize base syllable and tone recognition, and a word-class-based Chinese language model for linguistic decoding. The best recognition accuracy achieved is 88.69% for finally decoded Chinese characters, with 88.69%, 91.57%, and 81.37% accuracy for base syllables, tones, and tonal syllables respectively
Keywords
context-sensitive languages; decoding; hidden Markov models; natural languages; speech recognition; Chinese language; HMM; acoustic modeling; base syllable recognition; complete recognition; concatenated syllable matching algorithm; context-dependent models; continuous Mandarin speech; limited training data; linguistic decoding; multiple candidate searching technique; recognition accuracy; sub-syllabic models; tone recognition; very large vocabulary; word-class-based Chinese language model; Character recognition; Computer science; Concatenated codes; Context modeling; Data engineering; Decoding; Natural languages; Speech recognition; Training data; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location
Detroit, MI
ISSN
1520-6149
Print_ISBN
0-7803-2431-5
Type
conf
DOI
10.1109/ICASSP.1995.479273
Filename
479273
Link To Document