Title :
On the use of Early-To-Late Reverberation ratio for ASR in reverberant environments
Author :
Brutti, Alessio ; Matassoni, Marco
Author_Institution :
Fondazione Bruno Kessler, Center for Inf. & Commun. Technol., Trento, Italy
Abstract :
This work presents an analysis of distant-talking speech recognition in a variety of reverberant conditions, correlating ASR performance to the acoustic characteristics of a given propagation channel. In particular we show how, for a digit recognition task, the ASR accuracy is directly related to the Early-to-Late Reverberation ratio of the room impulse response, capturing in a single parameter the reverberation properties of a given channel independently of the setup. Consequently, this measure can be successfully considered for acoustic model training either selecting the most suitable model for a given spatial configuration, or defining the subset of RIRs to be used for the creation of multi-condition models. Experimental results on simulated data as well as on data generated with real impulse responses support our claims.
Keywords :
reverberation; speech recognition; transient response; ASR performance; acoustic characteristics; acoustic model training; digit recognition task; distant-talking speech recognition; early-to-late reverberation ratio; impulse responses; multicondition models; propagation channel; reverberant conditions; reverberant environments; reverberation properties; room impulse response; speech recognition; Accuracy; Microphones; Reverberation; Speech; Speech recognition; Training; direct-to-reverberant ratio; distant ASR; multi-condition training; reverberation; room impulse response;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854481