• DocumentCode
    417210
  • Title

    Entropy-based variable frame rate analysis of speech signals and its application to ASR

  • Author

    You, H. ; Zhu, Q. ; Alwan, A.

  • Author_Institution
    Electr. Eng. Dept., UCLA, Los Angeles, CA, USA
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Most speech processing algorithms analyze speech signals frame by frame with a fixed frame rate. Fixed-rate analysis is inconsistent with human speech perception and effectively assigns the same importance or ´weight´ to all equi-duration frames. In Zhu et al. (2000), we proposed a variable frame rate (VFR) analysis technique that is based on a Euclidian distance measure. In this paper, we propose another approach for VFR based on the entropy of the signal. We compare entropy and Euclidian distance measures for VFR in ASR experiments using the Aurora2 and T146 databases. Better performance is observed for the entropy-based VFR over our earlier VFR approach and over the fixed-rate system.
  • Keywords
    entropy; speech processing; speech recognition; ASR; Aurora2; Euclidian distance measures; T146 database; VFR; automatic speech recognition; entropy; fixed-rate analysis; performance; speech processing algorithms; speech signals; variable frame rate analysis; Acoustic noise; Automatic speech recognition; Covariance matrix; Distributed computing; Entropy; Random variables; Signal analysis; Signal processing; Speech analysis; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326044
  • Filename
    1326044