مرکز منطقه ای اطلاع رساني علوم و فناوري - Analysis and comparison of two speech feature extraction/compensation algorithms

DocumentCode :

806873

Title :

Analysis and comparison of two speech feature extraction/compensation algorithms

Author :

Deng, Li ; Wu, Jian ; Droppo, Jasha ; Acero, Alex

Author_Institution :

Microsoft Corp., Redmond, WA, USA

Volume :

Issue :

fYear :

2005

fDate :

6/1/2005 12:00:00 AM

Firstpage :

477

Lastpage :

480

Abstract :

Two feature extraction and compensation algorithms, feature-space minimum phone error (fMPE), which contributed to the recent significant progress in conversational speech recognition, and stereo-based piecewise linear compensation for environments (SPLICE), which has been used successfully in noise-robust speech recognition, are analyzed and compared. These two algorithms have been developed by very different motivations and been applied to very different speech-recognition tasks as well. While the mathematical construction of the two algorithms is ostensibly different, in this report, we establish a direct link between them. We show that both algorithms in the run-time operation accomplish feature extraction/compensation by adding a posterior-based weighted sum of "correction vectors," or equivalently the column vectors in the fMPE projection matrix, to the original, uncompensated features. Although the published fMPE algorithm empirically motivates such a feature extraction, operation as "a reasonable starting point for training" our analysis proves that it is a natural consequence of the rigorous minimum mean square error (MMSE) optimization rule as developed in SPLICE. Further, we review and compare related speech-recognition results with the use of fMPE and SPLICE algorithms. The results demonstrate the effectiveness of discriminative training on the feature extraction parameters (i.e., projection matrix in fMPE and equivalently correction vectors in SPLICE). The analysis and comparison of the two algorithms provide useful insight into the strong success of fMPE and point to further algorithm improvement and extension.

Keywords :

feature extraction; hidden Markov models; least mean squares methods; optimisation; piecewise linear techniques; speech recognition; MMSE; SPLICE; correction vectors; discriminative training; fMPE projection matrix; feature compensation algorithms; feature extraction; feature-space minimum phone error; hidden Markov model; minimum mean square error; optimization; speech recognition; stereo-based piecewise linear compensation for environment; Acoustic distortion; Algorithm design and analysis; Feature extraction; Mean square error methods; Noise robustness; Piecewise linear techniques; Runtime; Signal processing algorithms; Speech analysis; Speech recognition; Discriminative training; feature compensation; feature extraction; hidden Markov model; minimum classification error; minimum phone error; piecewise linear mapping; posterior probability; speech processing;

fLanguage :

English

Journal_Title :

Signal Processing Letters, IEEE

Publisher :

ieee

ISSN :

1070-9908

Type :

jour

DOI :

10.1109/LSP.2005.847861

Filename :

1430751

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=806873