مرکز منطقه ای اطلاع رساني علوم و فناوري - Golden Mandarin (I)-A real-time Mandarin speech dictation machine for Chinese language with very large vocabulary

DocumentCode :

921745

Title :

Golden Mandarin (I)-A real-time Mandarin speech dictation machine for Chinese language with very large vocabulary

Author :

Lee, Lin-shan ; Tseng, Chiu-Yu ; Gu, Hung-Yan ; Liu, Fu-hua ; Chang, Chen-hao ; Lin, Yueh-hong ; Lee, Yumin ; Tu, Shih-Lung ; Hsieh, Shew-Heng ; Chen, Chian-hung

Author_Institution :

Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan

Volume :

Issue :

fYear :

1993

fDate :

4/1/1993 12:00:00 AM

Firstpage :

158

Lastpage :

179

Abstract :

The first successfully implemented real-time Mandarin dictation machine, which recognizes Mandarin speech with very large vocabulary and almost unlimited texts for the input of Chinese characters into computers, is described. The machine is speaker-dependent, and the input speech is in the form of sequences of isolated syllables. The machine can be decomposed into two subsystems. The first subsystem recognizes the syllables using hidden Markov models. Because every syllable can represent many different homonym characters and form different multisyllabic words with syllables on its right or left, the second subsystem is needed to identify the exact characters from the syllables and correct the errors in syllable recognition. The real-time implementation is on an IBM PC/AT, connected to three sets of specially designed hardware boards on which seven TMS 320C25 chips operate in parallel. The preliminary test results indicate that it takes only about 0.45 s to dictate a syllable (or character) with an accuracy on the order of 90%

Keywords :

dictation; hidden Markov models; real-time systems; speech recognition equipment; Chinese language; Golden Mandarin (I); HMM; IBM PC/AT; Mandarin speech dictation machine; TMS 320C25 chips; hidden Markov models; homonym characters; multisyllabic words; real-time implementation; sequences of isolated syllables; speaker-dependent; speech recognition; syllable recognition; very large vocabulary; voice input; Character recognition; Computer science; Error correction; Hardware; Hidden Markov models; Lattices; Natural languages; Speech recognition; Text recognition; Vocabulary;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.222876

Filename :

222876

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=921745