مرکز منطقه ای اطلاع رساني علوم و فناوري - Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but limited training data

DocumentCode :

294525

Title :

Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but limited training data

Author :

Wang, Hsin Min ; Shen, Jia Lin ; Yang, Yen Ju ; Tseng, Chiu Yu ; Lee, Lin Shan

Author_Institution :

Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan

Volume :

fYear :

1995

fDate :

9-12 May 1995

Firstpage :

Abstract :

This paper presents the first known results for complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but very limited training data. Although some isolated-syllable-based or isolated-word-based large-vocabulary Mandarin speech recognition systems have been successfully developed, a continuous-speech-based system of this kind has never been reported before. For successful development of this system, several important techniques have been used, including acoustic modeling of a set of sub-syllabic models for base syllable recognition and another set of context-dependent models for tone recognition, a multiple candidate searching technique based on a concatenated syllable matching algorithm to synchronize base syllable and tone recognition, and a word-class-based Chinese language model for linguistic decoding. The best recognition accuracy achieved is 88.69% for finally decoded Chinese characters, with 88.69%, 91.57%, and 81.37% accuracy for base syllables, tones, and tonal syllables respectively

Keywords :

context-sensitive languages; decoding; hidden Markov models; natural languages; speech recognition; Chinese language; HMM; acoustic modeling; base syllable recognition; complete recognition; concatenated syllable matching algorithm; context-dependent models; continuous Mandarin speech; limited training data; linguistic decoding; multiple candidate searching technique; recognition accuracy; sub-syllabic models; tone recognition; very large vocabulary; word-class-based Chinese language model; Character recognition; Computer science; Concatenated codes; Context modeling; Data engineering; Decoding; Natural languages; Speech recognition; Training data; Vocabulary;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location :

Detroit, MI

ISSN :

1520-6149

Print_ISBN :

0-7803-2431-5

Type :

conf

DOI :

10.1109/ICASSP.1995.479273

Filename :

479273

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=294525