DocumentCode
290120
Title
A new method for segmenting continuous speech
Author
Pawate, B.I. ; Dowling, Eric
Author_Institution
Tsukuba Res. & Dev. Center, Texas Instrum., Ibaraki, Japan
Volume
i
fYear
1994
fDate
19-22 Apr 1994
Abstract
Speech recognition systems are increasingly utilized in various applications like telephone services where a user places a call by uttering the digits or the name of the person. One of the main problems in this application is the segmentation of the input utterance into speech and nonspeech portions. Current approaches typically suffer from two problems. They either incorporate noise as a part of the word to be enrolled or falsely classify a portion of a word as noise. As a result, recognition performance suffers. The authors present another approach to automatically segment continuous speech and create speaker dependent models. To verify the hypothesis, they use a database of 30 speakers whose speech has been recorded over the public switched telephone network. With this database, they benchmark their algorithm against a state of the art approach and show a 4× reduction in the error rate of the recognition system
Keywords
speech processing; speech recognition; telephony; algorithm; error rate; input utterance; nonspeech portion; public switched telephone network; recognition performance; segmentation; speaker dependent models; speech portion; speech recognition systems; telephone services; Databases; Detectors; Error analysis; Instruments; Noise level; Speech enhancement; Speech recognition; Target recognition; Telephony; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location
Adelaide, SA
ISSN
1520-6149
Print_ISBN
0-7803-1775-0
Type
conf
DOI
10.1109/ICASSP.1994.389357
Filename
389357
Link To Document