مرکز منطقه ای اطلاع رساني علوم و فناوري - High-dimensional linear representations for robust speech recognition

DocumentCode :

3635316

Title :

High-dimensional linear representations for robust speech recognition

Author :

Matthew Ager;Zoran Cvetković;Peter Sollich

Author_Institution :

Department of Mathematics, King´s College London

fYear :

2010

Firstpage :

Lastpage :

Abstract :

Phoneme classification is investigated in linear feature domains with the aim of improving the robustness to additive noise. Linear feature domains allow for exact noise adaptation and so should result in more accurate classification than representations involving nonlinear processing and dimensionality reduction. We develop a generative framework for phoneme classification using linear features. We first show results for a representation consisting of concatenated frames from the centre of the phoneme, each containing f frames. As no single f is optimal for all phonemes, we further average over models with a range of values of f. Next we improve results by including information from the entire phoneme. In the presence of additive noise, classification in this framework performs better than an analogous PLP classifier, adapted to noise using cepstral mean and variance normalisation, below 18dB SNR.

Keywords :

"Speech recognition","Additive noise","Noise robustness","Acoustic waves","Hidden Markov models","Acoustic noise","Automatic speech recognition","Cepstral analysis","Gaussian noise","Superluminescent diodes"

Publisher :

ieee

Conference_Titel :

Information Theory and Applications Workshop (ITA), 2010

Print_ISBN :

978-1-4244-7012-9

Type :

conf

DOI :

10.1109/ITA.2010.5454172

Filename :

5454172

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3635316