DocumentCode :
908598
Title :
On jointly learning the parameters in a character-synchronous integrated speech and language model
Author :
Chiang, Tung-Hui ; Lin, Yi-Chung ; Su, Keh-Yih
Author_Institution :
Comput. & Commun. Res. Labs., Ind. Technol. Res. Inst., Hsinchu, Taiwan
Volume :
4
Issue :
3
fYear :
1996
fDate :
5/1/1996 12:00:00 AM
Firstpage :
167
Lastpage :
189
Abstract :
A joint learning algorithm is proposed in this paper to enhance the integrated speech and language model operating in the character-synchronous mode. A character-synchronous approach is first proposed to integrate speech and language information, including morphology and parts-of-speech, for ranking the candidates right after each Chinese character is uttered. Since the search space is cut down very efficiently by applying high-level knowledge in early time, the character-synchronous score function enables our system to operate in real time. To further enhance system performance, a joint learning algorithm is then derived to adjust all parameters of the speech and language processing modules simultaneously, according to their contributions to the overall discrimination power, to minimize the error rate. The proposed approaches are compared with a baseline system, which directly couples an HMM-based speech recognizer with a bigram language model. The performance of 75.71% character accuracy rate is obtained for the baseline system when it is tested on the task of recognizing 1000 Chinese spoken sentences with a very large vocabulary (90495 words) in the speaker-independent isolated-character mode. The character accuracy rate is improved to 88.26% with the character-synchronous approach, and 94.16% after the joint learning algorithm is applied
Keywords :
error statistics; hidden Markov models; natural languages; parameter estimation; speech recognition; Chinese character; Chinese spoken sentences; character accuracy rate; character-synchronous approach; character-synchronous integrated speech-language model; character-synchronous score function; discrimination power; error rate; high-level knowledge; joint learning algorithm; language processing modules; morphology; parts-of-speech; search space; speaker-independent isolated-character mode; system performance; Error analysis; Hidden Markov models; Morphology; Natural languages; Power system modeling; Real time systems; Speech enhancement; Speech processing; Speech recognition; System performance;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.496214
Filename :
496214
Link To Document :
بازگشت