Continuous speech recognition without end-point detection

Author

Segawa, Osamu ; Takeda, Kazuya ; Itakura, Fumitada

Author_Institution

Graduate Sch. of Eng., Nagoya Univ., Japan

Volume

1

fYear

2001

fDate

2001

Firstpage

245

Abstract

A continuous speech recognition method that does not need explicit speech end-point detection is proposed. A one-pass decoding algorithm is modified to decode input speech of infinite length so that, with appropriate nonspeech models for silence and ambient noises, continuous speech recognition can be executed without explicit endpoint detection. The basic algorithm: 1) decodes a processing block of predetermined length, 2) traces back and finds the boundaries of the processing blocks where the word history in the preceding processing block is merged into one, and 3) restarts decoding from the boundary frame with the merged word history. The effectiveness of the method is verified by the two dictating experiments. With 100 consecutive sentences of utterances from a newspaper, the degradation of the recognition accuracy due to the modification of the decoder is about 5% compared with the results when the correct end-point is given. With a 30 minutes dialogue in a moving car, 75% correct and 69% accuracy score is obtained

Keywords

decoding; speech recognition; ambient noises; continuous speech recognition; dialogue; dictating experiments; merged word history; newspaper; one-pass decoding algorithm; recognition accuracy; silence; Decoding; Degradation; Energy measurement; History; Monitoring; Power engineering and energy; Robustness; Speech enhancement; Speech processing; Speech recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on

Conference_Location

Salt Lake City, UT

ISSN

1520-6149

Print_ISBN

0-7803-7041-4

Type

conf

DOI

10.1109/ICASSP.2001.940813

Filename

940813