مرکز منطقه ای اطلاع رساني علوم و فناوري - Combined waveform-cepstral representation for robust speech recognition

DocumentCode :

3643615

Title :

Combined waveform-cepstral representation for robust speech recognition

Author :

Matthew Ager;Zoran Cvetković;Peter Sollich

Author_Institution :

Department of Mathematics, King´s College London, UK

fYear :

2011

fDate :

7/1/2011 12:00:00 AM

Firstpage :

864

Lastpage :

868

Abstract :

High-dimensional acoustic waveform representations are studied as a front-end for noise robust automatic speech recognition using generative methods, in particular Gaussian mixture models and hidden Markov models. The proposed representations are compared with standard cepstral features on phoneme classification and recognition tasks. While lower error rates are achieved using cepstral features at very low noise levels, the acoustic waveform representations are much more robust to noise. A convex combination of acoustic waveforms and cepstral features is then considered and it achieves higher accuracy than either of the individual representations across all noise levels.

Keywords :

"Speech recognition","Speech","Hidden Markov models","Noise","Mel frequency cepstral coefficient"

Publisher :

ieee

Conference_Titel :

Information Theory Proceedings (ISIT), 2011 IEEE International Symposium on

ISSN :

2157-8095

Print_ISBN :

978-1-4577-0596-0

Electronic_ISBN :

2157-8117

Type :

conf

DOI :

10.1109/ISIT.2011.6034260

Filename :

6034260

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3643615