Title :
Use of prosodic information for Mandarin word post-recognition
Author :
Wu, Chung-Hsien ; Chen, Yeou-Jiunn
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Cheng Kung Univ., Tainan, Taiwan
Abstract :
A two-stage recognition scheme, phonetic recognition followed by prosodic recognition is established. In the phonetic recognition process, 21 initial and 37 final context-independent HMMs are used to construct the phonetic recognizer. In the prosodic recognizer, 175 context-dependent prosodic HMMs are used to model the complicated tone behavior for all possible tone concatenations. Five anti-prosodic HMMs, each corresponding to one lexical tone, are constructed to enhance the discrimination among prosodic HMMs. This system was evaluated in a speaker-dependent mode on a vocabulary size of thirty thousand words. The experimental results show that the recognition rate was improved from 80.3% to 86.7% using the prosodic information.
Keywords :
feature extraction; hidden Markov models; natural languages; speech recognition; Mandarin word post-recognition; context-dependent prosodic HMM; experimental results; feature extraction; phonetic recognition; prosodic information; prosodic recognition; recognition rate; speaker-dependent system; tone concatenations; two-stage recognition; vocabulary size; Computer science; Context modeling; Data mining; Databases; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech recognition; Viterbi algorithm; Vocabulary;
Conference_Titel :
TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE
Conference_Location :
Brisbane, Qld., Australia
Print_ISBN :
0-7803-4365-4
DOI :
10.1109/TENCON.1997.647305