DocumentCode :
748555
Title :
Multitapering and a wavelet variant of MFCC in speech recognition
Author :
Ricotti, L. Prina
Author_Institution :
Fondazione Ugo Bordoni, Roma, Italy
Volume :
152
Issue :
1
fYear :
2005
Firstpage :
29
Lastpage :
35
Abstract :
In speech recognition (ASR) based on hidden Markov models (HMM) it is necessary to obtain a spectral approximation with a reduced set of representation coefficients. The author introduces to the speech parameterisation scheme multitapering and a modification of the usual mel frequency cepstrum coefficients (MFCC) processing scheme based on wavelets on intervals (wavelet frequency coefficients, WFC). Phoneme recognition performance improvements compared to the MFCC have been experimentally verified on data from a speech database, using multitapering and WFC.
Keywords :
approximation theory; audio databases; hidden Markov models; signal representation; speech recognition; wavelet transforms; hidden Markov model; mel frequency cepstrum coefficient; spectral approximation; speech database; speech parameterisation scheme multitapering; speech recognition; wavelet frequency coefficient;
fLanguage :
English
Journal_Title :
Vision, Image and Signal Processing, IEE Proceedings -
Publisher :
iet
ISSN :
1350-245X
Type :
jour
DOI :
10.1049/ip-vis:20051004
Filename :
1408922
Link To Document :
بازگشت