Title :
Recognition of hesitations in spontaneous speech
Author :
O´Shaughnessy, Douglas
Author_Institution :
INRS-Telecommun., Quebec Univ., Verdun, Que., Canada
Abstract :
Both filled and unfilled (silent) hesitation types of pauses in a widely used speech database were examined for both unintended and intended pauses. A distinction is made between grammatical pauses (at major syntactic boundaries) and ungrammatical ones (within minor syntactic phrases). While unfilled pauses cannot be reliably thus separated based on silence duration alone, grammatical pauses tended to be longer. In the prepausal word before ungrammatical pauses, there were few continuation rises in fundamental frequency (F0), whereas 70% of the grammatical pauses were accompanied by a prior F0 rise. Identifying the syntactic function of such pauses could improve the performance of an automatic speech recognizer, by eliminating from consideration some hypotheses based on spectral analysis. Results are given which could allow simple identification of most filled and unfilled pauses and their syntactic function
Keywords :
speech recognition; automatic speech recognizer; filled pauses; fundamental frequency; grammatical pauses; hesitation types; intended pauses; silence duration; spectral analysis; speech database; syntactic boundaries; syntactic phrases; unfilled pauses; ungrammatical pauses; unintended pauses; Application software; Automatic speech recognition; Business; Databases; Delay; Frequency; Loudspeakers; Spectral analysis; Speech analysis; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.225857