Title :
Joint Optimization of the Frequency-Domain and Time-Domain Transformations in Deriving Generalized Static and Dynamic MFCCs
Author :
Lai, Yiu-Pong ; Siu, Manhung ; Mak, Brian
Author_Institution :
Dept. of Electr. & Electron. Eng., Hong Kong Univ. of Sci. & Technol.
Abstract :
Traditionally, static mel-frequency cepstral coefficients (MFCCs) are derived by discrete cosine transformation (DCT), and dynamic MFCCs are derived by linear regression. Their derivation may be generalized as a frequency-domain transformation of the log filter-bank energies (FBEs) followed by a time-domain transformation. In the past, these two transformations are usually estimated or optimized separately. In this letter, we consider sequences of log FBEs as a set of spectrogram images and investigate an image compression technique to jointly optimize the two transformations so that the reconstruction error of the spectrogram images is minimized; there is an efficient algorithm that solves the optimization problem. The framework allows extension to other optimization costs as well
Keywords :
cepstral analysis; channel bank filters; data compression; discrete cosine transforms; image coding; image reconstruction; minimisation; regression analysis; time-frequency analysis; DCT; discrete cosine transformation; dynamic MFCC; frequency-domain transformation; image compression technique; joint optimization; linear regression; log filter-bank; mel-frequency cepstral coefficient; spectrogram image reconstruction; time-domain transformation; Automatic speech recognition; Cepstral analysis; Discrete cosine transforms; Filtering; Linear regression; Mel frequency cepstral coefficient; Robustness; Spectrogram; Time domain analysis; Time frequency analysis; Discrete cosine transform (DCT); low-rank approximation of matrices; mel-frequency cepstral coefficients (MFCCs); time-frequency (TF) representation;
Journal_Title :
Signal Processing Letters, IEEE
DOI :
10.1109/LSP.2006.879452