مرکز منطقه ای اطلاع رساني علوم و فناوري - Model-based dereverberation of speech in the mel-spectral domain

DocumentCode :

2080436

Title :

Model-based dereverberation of speech in the mel-spectral domain

Author :

Sehr, Armin ; Kellermann, Walter

Author_Institution :

Multimedia Commun. & Signal Process., Univ. of Erlangen-Nuremberg, Erlangen

fYear :

2008

fDate :

26-29 Oct. 2008

Firstpage :

783

Lastpage :

787

Abstract :

A model-based dereverberation approach for robust distant-talking speech recognition employing the powerful acoustic model of the recognizer to describe the clean speech feature sequence is discussed. The clean speech model is combined with a statistical reverberation model describing the acoustic path between speaker and microphone directly in the mel-spectral domain. Dereverberation is performed during recognition by determining the most likely contributions of the combined model´s components to the current reverberant feature vector. The advantages of processing feature-domain representations of speech rather than using time- or frequency-domain speech representations are the dimension reduction and the possibility to obtain robust reverberation models valid for arbitrary speaker and microphone positions in the recording room. In this contribution, we emphasize that the criterion used for the dereverberation operation is equivalent to maximum a posteriori estimation. Connected-digit recognition experiments confirm the superior performance of the novel concept.

Keywords :

speech recognition; statistical analysis; distant-talking speech recognition; mel-spectral domain; model-based dereverberation; speech feature sequence; speech representations; statistical reverberation; Automatic speech recognition; Hidden Markov models; Humans; Loudspeakers; Microphones; Multimedia communication; Reverberation; Robustness; Speech processing; Speech recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Signals, Systems and Computers, 2008 42nd Asilomar Conference on

Conference_Location :

Pacific Grove, CA

ISSN :

1058-6393

Print_ISBN :

978-1-4244-2940-0

Electronic_ISBN :

1058-6393

Type :

conf

DOI :

10.1109/ACSSC.2008.5074516

Filename :

5074516

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2080436