DocumentCode
1749646
Title
Continuous speech recognition without end-point detection
Author
Segawa, Osamu ; Takeda, Kazuya ; Itakura, Fumitada
Author_Institution
Graduate Sch. of Eng., Nagoya Univ., Japan
Volume
1
fYear
2001
fDate
2001
Firstpage
245
Abstract
A continuous speech recognition method that does not need explicit speech end-point detection is proposed. A one-pass decoding algorithm is modified to decode input speech of infinite length so that, with appropriate nonspeech models for silence and ambient noises, continuous speech recognition can be executed without explicit endpoint detection. The basic algorithm: 1) decodes a processing block of predetermined length, 2) traces back and finds the boundaries of the processing blocks where the word history in the preceding processing block is merged into one, and 3) restarts decoding from the boundary frame with the merged word history. The effectiveness of the method is verified by the two dictating experiments. With 100 consecutive sentences of utterances from a newspaper, the degradation of the recognition accuracy due to the modification of the decoder is about 5% compared with the results when the correct end-point is given. With a 30 minutes dialogue in a moving car, 75% correct and 69% accuracy score is obtained
Keywords
decoding; speech recognition; ambient noises; continuous speech recognition; dialogue; dictating experiments; merged word history; newspaper; one-pass decoding algorithm; recognition accuracy; silence; Decoding; Degradation; Energy measurement; History; Monitoring; Power engineering and energy; Robustness; Speech enhancement; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location
Salt Lake City, UT
ISSN
1520-6149
Print_ISBN
0-7803-7041-4
Type
conf
DOI
10.1109/ICASSP.2001.940813
Filename
940813
Link To Document