مرکز منطقه ای اطلاع رساني علوم و فناوري - Multitapering and a wavelet variant of MFCC in speech recognition

DocumentCode :

748555

Title :

Multitapering and a wavelet variant of MFCC in speech recognition

Author :

Ricotti, L. Prina

Author_Institution :

Fondazione Ugo Bordoni, Roma, Italy

Volume :

152

Issue :

fYear :

2005

Firstpage :

Lastpage :

Abstract :

In speech recognition (ASR) based on hidden Markov models (HMM) it is necessary to obtain a spectral approximation with a reduced set of representation coefficients. The author introduces to the speech parameterisation scheme multitapering and a modification of the usual mel frequency cepstrum coefficients (MFCC) processing scheme based on wavelets on intervals (wavelet frequency coefficients, WFC). Phoneme recognition performance improvements compared to the MFCC have been experimentally verified on data from a speech database, using multitapering and WFC.

Keywords :

approximation theory; audio databases; hidden Markov models; signal representation; speech recognition; wavelet transforms; hidden Markov model; mel frequency cepstrum coefficient; spectral approximation; speech database; speech parameterisation scheme multitapering; speech recognition; wavelet frequency coefficient;

fLanguage :

English

Journal_Title :

Vision, Image and Signal Processing, IEE Proceedings -

Publisher :

iet

ISSN :

1350-245X

Type :

jour

DOI :

10.1049/ip-vis:20051004

Filename :

1408922

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=748555