DocumentCode :
705373
Title :
Articulatory based speech models for blind speech dereverberation using sequential Monte Carlo methods
Author :
Evers, Christine ; Hopgood, James R.
Author_Institution :
Inst. for Digital Commun., Univ. of Edinburgh, Edinburgh, UK
fYear :
2010
fDate :
23-27 Aug. 2010
Firstpage :
2131
Lastpage :
2135
Abstract :
Room reverberation leads to reduced intelligibility of audio signals. Enhancement is thus crucial for high-quality audio and scene analysis applications. This paper proposes to directly and optimally estimate the source signal and acoustic channel from the distorted observations. The remaining model parameters are sampled from a particle filter, facilitating real-time dereverberation. The approach was previously successfully applied to single- and multisensor blind dereverberation. Enhancement can be improved upon by accurately modelling the speech production system. This paper therefore extends the blind dereverberation approach to incorporate a novel source model based on parallel formant synthesis and compares the approach to one using a time-varying AR model, with parameters varying according to a random walk. Experimental data shows that dereverberation using the proposed model is improved for vowels, stop consonants, and fricatives.
Keywords :
Monte Carlo methods; acoustic signal processing; audio signal processing; autoregressive processes; blind source separation; channel estimation; particle filtering (numerical methods); reverberation; signal sampling; speech enhancement; acoustic channel estimation; articulatory based speech models; audio signal intelligibility reduction; blind speech dereverberation; high-quality audio analysis application; high-quality scene analysis application; model parameter sampling; parallel formant synthesis; particle filter; random walk; room reverberation; sequential Monte Carlo methods; source signal estimation; speech production system; time-varying AR model; Approximation methods; Bandwidth; Channel estimation; Markov processes; Resonant frequency; Shape; Speech;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2010 18th European
Conference_Location :
Aalborg
ISSN :
2219-5491
Type :
conf
Filename :
7096646
Link To Document :
بازگشت