مرکز منطقه ای اطلاع رساني علوم و فناوري - Adapting noisy speech models

DocumentCode :

2800755

Title :

Adapting noisy speech models — Extended uncertainty decoding

Author :

Lu, Jianhua ; Ming, Ji ; Woods, Roger

Author_Institution :

Inst. of Electron., Commun. & Inf. Technol., Queen´´s Univ. Belfast, Belfast, UK

fYear :

2010

fDate :

14-19 March 2010

Firstpage :

4322

Lastpage :

4325

Abstract :

Most conventional techniques for noise adaptation assume a clean initial speech model which is adapted to a specific noise condition using adaptation data accumulated from the condition. In this paper, a different problem is considered, i.e. adapting a noisy speech model to a specific noise condition. For example, the initial noisy model may be a multi-condition model which is used to provide more accurate transcripts for the adaptation data than could be provided by a clean model, thereby obtaining a more accurate adaptation. We develop the formulation for this new problem by combining and extending maximum likelihood linear regression (MLLR), constrained MLLR (CMLLR) and uncertainty decoding techniques. We also present an implementation which has been tested on the Aurora 4 database, assuming an initial multi-condition model trained using white noise corrupted data. Significant word error rate (WER) reductions are achieved in comparison with other approaches.

Keywords :

decoding; maximum likelihood estimation; regression analysis; speech coding; Aurora 4 database; constrained maximum likelihood linear regression; extended uncertainty decoding; multicondition model; noise condition; noisy speech models; white noise corrupted data; word error rate reductions; Acoustic noise; Gaussian noise; Maximum likelihood decoding; Maximum likelihood linear regression; Noise robustness; Speech enhancement; Speech recognition; Transforms; Uncertainty; Working environment noise; Aurora 4; noise adaptation; noise compensation; noise robustness; speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location :

Dallas, TX

ISSN :

1520-6149

Print_ISBN :

978-1-4244-4295-9

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2010.5495660

Filename :

5495660

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2800755