DocumentCode :
323782
Title :
Improved search strategy for large vocabulary continuous Mandarin speech recognition
Author :
Ho, Tai-Hsuan ; Yang, Kae-Cherng ; Huang, Kuo-Hsun ; Lee, Lin-shan
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei, Taiwan
Volume :
2
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
825
Abstract :
This paper presents a new search strategy for large vocabulary continuous Mandarin speech recognition considering the special structure of the Chinese language. This strategy is composed of forward and backward passes, between which a high-quality syllable lattice is generated to bridge the syllable-level and word-level decoding processes. In the forward pass, considering the small number of syllables in the Chinese language, a frame-synchronous stack decoder is used to integrate the high-order syllable N-Gram language model, so as to generate a very accurate and compact syllable lattice. In the backward pass, considering the special monosyllabic wording structure in the Chinese language, the search space for the word-level decoding is expanded dynamically from the syllable lattice, and the best word sequence is extracted based on the knowledge provided by the word pronunciation lexicon and the word N-Gram language model. In the preliminary experiments, it was found that, with this strategy, the character error rate can be reduced by more than 20% as compared with a previous system using syllable-aligned lattice approach on a speaker-adaptive continuous speech recognition task
Keywords :
decoding; search problems; speech recognition; Chinese language; Mandarin speech recognition; backward pass; character error rate; forward pass; frame-synchronous stack decoder; high-order syllable N-Gram language model; high-quality syllable lattice; large vocabulary continuous speech recognition; monosyllabic wording structure; search strategy; syllable-level decoding process; word pronunciation lexicon; word-level decoding process; Acoustic beams; Bridges; Computer science; Decoding; Error analysis; Information science; Lattices; Natural languages; Speech recognition; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.675392
Filename :
675392
Link To Document :
بازگشت