Title :
Visually-Derived Wiener Filters for Speech Enhancement
Author :
Almajai, I. ; Milner, B. ; Darch, J. ; Vaseghi, Saeed
Author_Institution :
Sch. of Comput. Sci., East Anglia Univ., Norwich, UK
Abstract :
This work begins by examining the correlation between audio and visual speech features and reveals higher correlation to exist within individual phoneme sounds rather than globally across all speech. Utilising this correlation, a visually-derived Wiener filter is proposed in which clean power spectrum estimates are obtained from visual speech features. Two methods of extracting clean power spectrum estimates are made; first from a global estimate using a single Gaussian mixture model (GMM), and second from phoneme-specific estimates using a hidden Markov model (HMM)-GMM structure. Measurement of estimation accuracy reveals that the phoneme-specific (HMM-GMM) system leads to lower estimation errors than the global (GMM) system. Finally, the effectiveness of visually-derived Wiener filtering is examined.
Keywords :
Gaussian processes; Wiener filters; audio signal processing; hidden Markov models; image processing; speech enhancement; audio speech features; clean power spectrum estimates; hidden Markov model; phoneme-specific HMM-GMM system; single Gaussian mixture model; speech enhancement; visual speech features; visually-derived Wiener filters; Active appearance model; Filter bank; Hidden Markov models; Lips; Mouth; Shape control; Speech enhancement; Tongue; Vectors; Wiener filter; Audio-visual; GMM; HMM; Wiener filter; speech enhancement;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.366980