• DocumentCode
    3440337
  • Title

    Estimating the speaking rate by vowel detection

  • Author

    Pfau, T. ; Ruske, G.

  • Author_Institution
    Inst. for Human-Machine-Commun., Tech. Univ. Munchen, Germany
  • Volume
    2
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    945
  • Abstract
    We present a new feature-based method for estimating the speaking rate by detecting vowels in continuous speech. The features used are the modified loudness and the zerocrossing rate which are both calculated in the standard preprocessing unit of our speech recognition system. As vowels in general correspond to syllable nuclei, the feature-based vowel rate is comparable to an estimate of the lexically-based syllable rate. The vowel detector presented is tested on the spontaneously spoken German Verbmobil task and is evaluated using manually transcribed data. The lowest vowel error rate (including insertions) on the defined test set is 22.72% on average over all vowels. Additionally correlation coefficients between our estimates and reference rates are calculated. These coefficients reach up to 0.796 and therefore are comparable to those for lexically-based measures (like the phone rate) on other tasks. The accuracy is sufficient to use our measurement for speaking rate adaptation
  • Keywords
    acoustic correlation; parameter estimation; speech processing; speech recognition; accuracy; acoustic signal; continuous speech; correlation coefficients; feature-based method; feature-based vowel rate; insertions; lexically-based syllable rate; manually transcribed data; measurement; modified loudness; phone rate; preprocessing unit; reference rates; speaking rate adaptation; speaking rate estimation; speech recognition system; spontaneously spoken German Verbmobil task; syllable nuclei; vowel detection; vowel detector; vowel error rate; zerocrossing rate; Automatic speech recognition; Databases; Degradation; Detectors; Error analysis; Maximum likelihood linear regression; Robustness; Speech processing; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.675422
  • Filename
    675422