Title :
High-dimensional linear representations for robust speech recognition
Author :
Matthew Ager;Zoran Cvetković;Peter Sollich
Author_Institution :
Department of Mathematics, King´s College London
Abstract :
Phoneme classification is investigated in linear feature domains with the aim of improving the robustness to additive noise. Linear feature domains allow for exact noise adaptation and so should result in more accurate classification than representations involving nonlinear processing and dimensionality reduction. We develop a generative framework for phoneme classification using linear features. We first show results for a representation consisting of concatenated frames from the centre of the phoneme, each containing f frames. As no single f is optimal for all phonemes, we further average over models with a range of values of f. Next we improve results by including information from the entire phoneme. In the presence of additive noise, classification in this framework performs better than an analogous PLP classifier, adapted to noise using cepstral mean and variance normalisation, below 18dB SNR.
Keywords :
"Speech recognition","Additive noise","Noise robustness","Acoustic waves","Hidden Markov models","Acoustic noise","Automatic speech recognition","Cepstral analysis","Gaussian noise","Superluminescent diodes"
Conference_Titel :
Information Theory and Applications Workshop (ITA), 2010
Print_ISBN :
978-1-4244-7012-9
DOI :
10.1109/ITA.2010.5454172