DocumentCode :
2574763
Title :
Statistical models for speech dereverberation
Author :
Yoshioka, Takuya ; Kameoka, Hirokazu ; Nakatani, Tomohiro ; Okuno, Hiroshi G.
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
fYear :
2009
fDate :
18-21 Oct. 2009
Firstpage :
145
Lastpage :
148
Abstract :
This paper discusses a statistical-model-based approach to speech dereverberation. With this approach, we first define parametric statistical models of probability density functions (pdfs) for a clean speech signal and a room transmission channel, then estimate the model parameters, and finally recover the clean speech signal by using the pdfs with the estimated parameter values. The key to the success of this approach lies in the definition of the models of the clean speech signal and room transmission channel pdfs. This paper presents several statistical models (including newly proposed ones) and compares them in a large-scale experiment. As regards the room transmission channel pdf, an autoregressive (AR) model, an autoregressive power spectral density (ARPSD) model, and a moving-average power spectral density (MAPSD) model are considered. A clean speech signal pdf model is selected according to the room transmission channel pdf model. The AR model exhibited the highest dereverberation accuracy when a reverberant speech signal of 2 sec or longer was available while the other two models outperformed the AR model when only a 1-sec reverberant speech signal was available.
Keywords :
probability; reverberation; speech processing; autoregressive model; autoregressive power spectral density model; clean speech signal; moving-average power spectral density model; probability density functions; room transmission channel; speech dereverberation; statistical-model-based approach; Acoustics; Degradation; Frequency; Microphones; Noise reduction; Parameter estimation; Probability density function; Reverberation; Speech enhancement; Speech processing; Dereverberation; statistical model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09. IEEE Workshop on
Conference_Location :
New Paltz, NY
ISSN :
1931-1168
Print_ISBN :
978-1-4244-3678-1
Electronic_ISBN :
1931-1168
Type :
conf
DOI :
10.1109/ASPAA.2009.5346489
Filename :
5346489
Link To Document :
بازگشت