Title :
Multitapering and a wavelet variant of MFCC in speech recognition
Author :
Ricotti, L. Prina
Author_Institution :
Fondazione Ugo Bordoni, Roma, Italy
Abstract :
In speech recognition (ASR) based on hidden Markov models (HMM) it is necessary to obtain a spectral approximation with a reduced set of representation coefficients. The author introduces to the speech parameterisation scheme multitapering and a modification of the usual mel frequency cepstrum coefficients (MFCC) processing scheme based on wavelets on intervals (wavelet frequency coefficients, WFC). Phoneme recognition performance improvements compared to the MFCC have been experimentally verified on data from a speech database, using multitapering and WFC.
Keywords :
approximation theory; audio databases; hidden Markov models; signal representation; speech recognition; wavelet transforms; hidden Markov model; mel frequency cepstrum coefficient; spectral approximation; speech database; speech parameterisation scheme multitapering; speech recognition; wavelet frequency coefficient;
Journal_Title :
Vision, Image and Signal Processing, IEE Proceedings -
DOI :
10.1049/ip-vis:20051004