Title :
A Visual Silence Detector Constraining Speech Source Separation
Author :
Gonzalez, Isabel ; Ravyse, Ilse ; Brouckxon, Henk ; Verhelst, Werner ; Jiang, Dongmei ; Sahli, Hichem
Abstract :
We propose an audiovisual source separation algorithm for speech signals. In our proposed algorithm we first extract the time segments with low activity of the mouth region from synchronous video recordings. An automatically selected optimal classifier is used to detect silent intervals in these instants of low visual mouth activity. Then, the source separation problem is formulated and solved for the entire signal duration. Our approach was tested on two challenging speech corpora with two speakers and two microphones, namely in the first corpus separate source signals were mixed in a simulated room, and the second corpus contains recorded conversations. The results are promising on both corpora: with the visual silence detector the performance of the source separation algorithm, measured by the signal to noise inference ratio increases.
Keywords :
audio-visual systems; source separation; speech processing; video recording; audiovisual source separation; low visual mouth activity; microphones; optimal classifier; speakers; speech corpora; speech signals; speech source separation; synchronous video recordings; visual silence detector; Detectors; Inference algorithms; Microphones; Mouth; Noise measurement; Signal to noise ratio; Source separation; Speech; Testing; Video recording;
Conference_Titel :
Image and Graphics, 2009. ICIG '09. Fifth International Conference on
Conference_Location :
Xi´an, Shanxi
Print_ISBN :
978-1-4244-5237-8
DOI :
10.1109/ICIG.2009.146