مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust speech recognition using dynamic noise adaptation

DocumentCode :

2174570

Title :

Robust speech recognition using dynamic noise adaptation

Author :

Rennie, Steven ; Dognin, Pierre ; Fousek, Petr

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

4592

Lastpage :

4595

Abstract :

Dynamic noise adaptation (DNA) is a model-based technique for improving automatic speech recognition (ASR) performance in noise. DNA has shown promise on artificially mixed data such as the Aurora II and DNA+Aurora II tasks - significantly outperforming well-known techniques like the ETSI AFE and fMLLR - but has never been tried on real data. In this paper, we present new results generated by commercial-grade ASR systems trained on large amounts of data. We show that DNA improves upon the performance of the spectral subtraction (SS) and stochastic fMLLR algorithms of our embedded recognizers, particularly in unseen noise conditions, and describe how DNA has been evolved to become suitable for deployment in low-latency ASR systems. DNA improves our best embedded system, which utilizes SS, fMLLR, and fMPE by over 22% relative at SNRs below 6 dB, reducing the word error rate in these adverse conditions from 4.24% to 3.29%.

Keywords :

noise; speech recognition; stochastic processes; DNA+Aurora II task; ETSI AFE technique; SS algorithm; dynamic noise adaptation; fMLLR technique; robust ASR; robust automatic speech recognition; spectral subtraction algorithm; stochastic fMLLR algorithm; word error rate reduction; Adaptation models; DNA; Hidden Markov models; Signal to noise ratio; Speech; Speech recognition; Algonquin; DNA + Aurora II; Dynamic Noise Adaptation (DNA); ETSI AFE; Vector Taylor Series (VTS); fMLLR; fMPE; model adaptation; robust speech recognition (ASR); spectral subtraction;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5947377

Filename :

5947377

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2174570