DocumentCode
3440337
Title
Estimating the speaking rate by vowel detection
Author
Pfau, T. ; Ruske, G.
Author_Institution
Inst. for Human-Machine-Commun., Tech. Univ. Munchen, Germany
Volume
2
fYear
1998
fDate
12-15 May 1998
Firstpage
945
Abstract
We present a new feature-based method for estimating the speaking rate by detecting vowels in continuous speech. The features used are the modified loudness and the zerocrossing rate which are both calculated in the standard preprocessing unit of our speech recognition system. As vowels in general correspond to syllable nuclei, the feature-based vowel rate is comparable to an estimate of the lexically-based syllable rate. The vowel detector presented is tested on the spontaneously spoken German Verbmobil task and is evaluated using manually transcribed data. The lowest vowel error rate (including insertions) on the defined test set is 22.72% on average over all vowels. Additionally correlation coefficients between our estimates and reference rates are calculated. These coefficients reach up to 0.796 and therefore are comparable to those for lexically-based measures (like the phone rate) on other tasks. The accuracy is sufficient to use our measurement for speaking rate adaptation
Keywords
acoustic correlation; parameter estimation; speech processing; speech recognition; accuracy; acoustic signal; continuous speech; correlation coefficients; feature-based method; feature-based vowel rate; insertions; lexically-based syllable rate; manually transcribed data; measurement; modified loudness; phone rate; preprocessing unit; reference rates; speaking rate adaptation; speaking rate estimation; speech recognition system; spontaneously spoken German Verbmobil task; syllable nuclei; vowel detection; vowel detector; vowel error rate; zerocrossing rate; Automatic speech recognition; Databases; Degradation; Detectors; Error analysis; Maximum likelihood linear regression; Robustness; Speech processing; Speech recognition; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location
Seattle, WA
ISSN
1520-6149
Print_ISBN
0-7803-4428-6
Type
conf
DOI
10.1109/ICASSP.1998.675422
Filename
675422
Link To Document