مرکز منطقه ای اطلاع رساني علوم و فناوري - A comparison of front-end compensation strategies for robust LVCSR under room reverberation and increased vocal effort

DocumentCode :

3648284

Title :

A comparison of front-end compensation strategies for robust LVCSR under room reverberation and increased vocal effort

Author :

Seyed Omid Sadjadi;Hynek Bořil;John H.L. Hansen

Author_Institution :

Center for Robust Speech Systems (CRSS), The University of Texas at Dallas, USA

fYear :

2012

Firstpage :

4701

Lastpage :

4704

Abstract :

Automatic speech recognition is known to deteriorate in the presence of room reverberation and variation of vocal effort in speakers. This study considers robustness of several state-of-the-art front-end feature extraction and normalization strategies to these sources of speech signal variability in the context of large vocabulary continuous speech recognition (LVCSR). A speech database recorded in an anechoic room, capturing modal speech and speech produced at different levels of vocal effort, is reverberated using measured room impulse responses and utilized in the evaluations. It is shown that the combination of recently introduced mean Hilbert envelope coefficients (MHEC) and a normalization strategy combining cepstral gain normalization and modified RASTA filtering (CGN RASTALP) provides considerable recognition performance gains for reverberant modal and high vocal effort speech.

Keywords :

"Speech","Reverberation","Speech recognition","Mel frequency cepstral coefficient","Robustness","Feature extraction"

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Type :

conf

DOI :

10.1109/ICASSP.2012.6288968

Filename :

6288968

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3648284