DocumentCode
417210
Title
Entropy-based variable frame rate analysis of speech signals and its application to ASR
Author
You, H. ; Zhu, Q. ; Alwan, A.
Author_Institution
Electr. Eng. Dept., UCLA, Los Angeles, CA, USA
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
Most speech processing algorithms analyze speech signals frame by frame with a fixed frame rate. Fixed-rate analysis is inconsistent with human speech perception and effectively assigns the same importance or ´weight´ to all equi-duration frames. In Zhu et al. (2000), we proposed a variable frame rate (VFR) analysis technique that is based on a Euclidian distance measure. In this paper, we propose another approach for VFR based on the entropy of the signal. We compare entropy and Euclidian distance measures for VFR in ASR experiments using the Aurora2 and T146 databases. Better performance is observed for the entropy-based VFR over our earlier VFR approach and over the fixed-rate system.
Keywords
entropy; speech processing; speech recognition; ASR; Aurora2; Euclidian distance measures; T146 database; VFR; automatic speech recognition; entropy; fixed-rate analysis; performance; speech processing algorithms; speech signals; variable frame rate analysis; Acoustic noise; Automatic speech recognition; Covariance matrix; Distributed computing; Entropy; Random variables; Signal analysis; Signal processing; Speech analysis; Speech processing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326044
Filename
1326044
Link To Document