• DocumentCode
    388583
  • Title

    On the use of transient information in speech recognition

  • Author

    Lienard, Jean-Sylvain ; Soong, Frank K.

  • Author_Institution
    Bell Laboratories, Murray Hill, New Jersey
  • Volume
    9
  • fYear
    1984
  • fDate
    30742
  • Firstpage
    9
  • Lastpage
    12
  • Abstract
    In this paper we investigate the effects of signal processing on the performance of isolated-word recognition by changing various time-resolution related parameters. The vocabulary used, { , is a highly confusable subset of the 39-word alpha-digit database. We showed that the recognition performance is significantly improved by trace segmentation which compresses the steady-state parts of speech signals and refines the endpoints. By changing the cutoff frequency of the low-pass filter in the filterbank analysis, we observed the existence of an optimal region of cutoff frequencies ranging from 50 to 100 Hz (at -6 dB). Outside this region, the performance does not deteriorate completely even at a very low cutoff frequency where the transients are severely distorted. This phenomenon was explained by the fact of spectral modification of the steady-state vowels following the initial transients.
  • Keywords
    Band pass filters; Cutoff frequency; Databases; Filter bank; Linear predictive coding; Sampling methods; Shape control; Speech analysis; Speech recognition; Steady-state;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '84.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1984.1172563
  • Filename
    1172563