Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but limited training data

Author

Wang, Hsin Min ; Shen, Jia Lin ; Yang, Yen Ju ; Tseng, Chiu Yu ; Lee, Lin Shan

Author_Institution

Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan

Volume

1

fYear

1995

fDate

9-12 May 1995

Firstpage

61

Abstract

This paper presents the first known results for complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but very limited training data. Although some isolated-syllable-based or isolated-word-based large-vocabulary Mandarin speech recognition systems have been successfully developed, a continuous-speech-based system of this kind has never been reported before. For successful development of this system, several important techniques have been used, including acoustic modeling of a set of sub-syllabic models for base syllable recognition and another set of context-dependent models for tone recognition, a multiple candidate searching technique based on a concatenated syllable matching algorithm to synchronize base syllable and tone recognition, and a word-class-based Chinese language model for linguistic decoding. The best recognition accuracy achieved is 88.69% for finally decoded Chinese characters, with 88.69%, 91.57%, and 81.37% accuracy for base syllables, tones, and tonal syllables respectively

Keywords

context-sensitive languages; decoding; hidden Markov models; natural languages; speech recognition; Chinese language; HMM; acoustic modeling; base syllable recognition; complete recognition; concatenated syllable matching algorithm; context-dependent models; continuous Mandarin speech; limited training data; linguistic decoding; multiple candidate searching technique; recognition accuracy; sub-syllabic models; tone recognition; very large vocabulary; word-class-based Chinese language model; Character recognition; Computer science; Concatenated codes; Context modeling; Data engineering; Decoding; Natural languages; Speech recognition; Training data; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location

Detroit, MI

ISSN

1520-6149

Print_ISBN

0-7803-2431-5

Type

conf

DOI

10.1109/ICASSP.1995.479273

Filename

479273