Title :
A new method for segmenting continuous speech
Author :
Pawate, B.I. ; Dowling, Eric
Author_Institution :
Tsukuba Res. & Dev. Center, Texas Instrum., Ibaraki, Japan
Abstract :
Speech recognition systems are increasingly utilized in various applications like telephone services where a user places a call by uttering the digits or the name of the person. One of the main problems in this application is the segmentation of the input utterance into speech and nonspeech portions. Current approaches typically suffer from two problems. They either incorporate noise as a part of the word to be enrolled or falsely classify a portion of a word as noise. As a result, recognition performance suffers. The authors present another approach to automatically segment continuous speech and create speaker dependent models. To verify the hypothesis, they use a database of 30 speakers whose speech has been recorded over the public switched telephone network. With this database, they benchmark their algorithm against a state of the art approach and show a 4× reduction in the error rate of the recognition system
Keywords :
speech processing; speech recognition; telephony; algorithm; error rate; input utterance; nonspeech portion; public switched telephone network; recognition performance; segmentation; speaker dependent models; speech portion; speech recognition systems; telephone services; Databases; Detectors; Error analysis; Instruments; Noise level; Speech enhancement; Speech recognition; Target recognition; Telephony; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
Print_ISBN :
0-7803-1775-0
DOI :
10.1109/ICASSP.1994.389357