DocumentCode
311031
Title
An efficient search method for large-vocabulary continuous-speech recognition
Author
Hanazawa, Ken ; Minami, Yasuhiro ; Furui, Sadaoki
Author_Institution
Tokyo Inst. of Technol., Japan
Volume
3
fYear
1997
fDate
21-24 Apr 1997
Firstpage
1787
Abstract
This paper proposes an efficient method for large-vocabulary continuous-speech recognition, using a compact data structure and an efficient search algorithm. We introduce a very compact data structure DAWG as a lexicon to reduce the search space. We also propose a search algorithm to obtain the N-best hypotheses using the DAWG structure. This search algorithm is composed of two phases: “forward search” and “traceback”. Forward search, which basically uses the time-synchronous Viterbi algorithm, merges candidates and stores the information about them in DAWG structures to create phoneme graphs. Traceback traces the phoneme graphs to obtain the N-best hypotheses. An evaluation of this method´s performance using a speech-recognition-based telephone-directory-assistance system having a 4000-word vocabulary confirmed that our strategy improves speech recognition in terms of time and recognition rate
Keywords
data structures; graph theory; search problems; speech recognition; telephony; DAWG structures; N-best hypotheses; compact data structure; efficient search method; forward search; large-vocabulary continuous-speech recognition; lexicon; merging; phoneme graphs; speech-recognition-based telephone-directory-assistance system; storage; time-synchronous Viterbi algorithm; traceback; Cities and towns; Data structures; Humans; Information retrieval; Search methods; Space technology; Speech recognition; Telephony; Tree data structures; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location
Munich
ISSN
1520-6149
Print_ISBN
0-8186-7919-0
Type
conf
DOI
10.1109/ICASSP.1997.598877
Filename
598877
Link To Document