مرکز منطقه ای اطلاع رساني علوم و فناوري - Compensation of nonlinear distortions in speech for automatic recognition

DocumentCode :

3670739

Title :

Compensation of nonlinear distortions in speech for automatic recognition

Author :

Jiri Malek;Jan Silovsky;Petr Cerva;Zbynek Koldovsky;Jan Nouza;Jindrich Zdansky

Author_Institution :

Faculty of Mechatronics, Informatics, and Interdisciplinary Studies, Technical University of Liberec, Studentská

fYear :

2015

fDate :

7/1/2015 12:00:00 AM

Firstpage :

Lastpage :

Abstract :

This paper addresses improvement of automatic transcription of speech distorted already during recording or by consequent processing. We focus on distortions that cannot be represented by most often used models, that is, as an additive noise or a linear convolutive channel distortion. We consider a) signals distorted through overgained microphone preamplifier and b) recordings exhibiting unnatural spectral sparseness, caused by application of excessive denoising or low-bit-rate compression. We demonstrate that these distortions deteriorate ASR accuracy significantly. To compensate, we propose to employ a combination of two general robust speech recognition techniques: a front-end feature normalization method and a channel/speaker adaptation technique. We present a significant improvement of transcription accuracy in the case of lectures distorted during recording, compressed broadcast data and utterances recorded with an inappropriately applied denoising.

Keywords :

"Nonlinear distortion","Speech","Acoustic distortion","Accuracy","Silicon","Speech recognition"

Publisher :

ieee

Conference_Titel :

Telecommunications and Signal Processing (TSP), 2015 38th International Conference on

Type :

conf

DOI :

10.1109/TSP.2015.7296378

Filename :

7296378

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3670739