مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust Feature Extraction for Continuous Speech Recognition Using the MVDR Spectrum Estimation Method

DocumentCode :

865580

Title :

Robust Feature Extraction for Continuous Speech Recognition Using the MVDR Spectrum Estimation Method

Author :

Dharanipragada, Satya ; Yapanel, Umit H. ; Rao, Bhaskar D.

Author_Institution :

Citadel Investment Group, Chicago, IL

Volume :

Issue :

fYear :

2007

Firstpage :

224

Lastpage :

234

Abstract :

This paper describes a robust feature extraction technique for continuous speech recognition. Central to the technique is the minimum variance distortionless response (MVDR) method of spectrum estimation. We consider incorporating perceptual information in two ways: 1) after the MVDR power spectrum is computed and 2) directly during the MVDR spectrum estimation. We show that incorporating perceptual information directly into the spectrum estimation improves both robustness and computational efficiency significantly. We analyze the class separability and speaker variability properties of the features using a Fisher linear discriminant measure and show that these features provide better class separability and better suppression of speaker-dependent information than the widely used mel frequency cepstral coefficient (MFCC) features. We evaluate the technique on four different tasks: an in-car speech recognition task, the Aurora-2 matched task, the Wall Street Journal (WSJ) task, and the Switchboard task. The new feature extraction technique gives lower word-error-rates than the MFCC and perceptual linear prediction (PLP) feature extraction techniques in most cases. Statistical significance tests reveal that the improvement is most significant in high noise conditions. The technique thus provides improved robustness to noise without sacrificing performance in clean conditions

Keywords :

feature extraction; speech processing; speech recognition; statistical testing; Aurora-2 matched task; Fisher linear discriminant measure; MVDR spectrum estimation method; Switchboard task; Wall Street Journal task; continuous speech recognition; feature extraction techniques; in-car speech recognition task; mel frequency cepstral coefficient; minimum variance distortionless response; perceptual linear prediction; robust feature extraction; speaker-dependent information; Cepstral analysis; Computational efficiency; Feature extraction; Frequency measurement; Information analysis; Mel frequency cepstral coefficient; Robustness; Spectral analysis; Speech recognition; Testing; Distortionless response; minimum variance; robust feature extraction for continuous speech recognition; spectral analysis; speech analysis;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2006.876776

Filename :

4032769

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=865580