DocumentCode :
3429977
Title :
Speech Activity Detection with Lip Movement Image Signals
Author :
Lee, Soo-jong ; Park, Jun ; Kim, Eung-Kyeu
Author_Institution :
ETRI, Daejeon
fYear :
2007
fDate :
22-24 Aug. 2007
Firstpage :
403
Lastpage :
406
Abstract :
This paper describes an attempt to correlate lip movement visual information acquired via a camera with speech audio information acquired via a microphone from a human speaker in order to prevent audio created by external noise from being misrecognized as speech emitted by said speaker. Images of the face of a human speaker are acquired via a PC camera and are then separated into images that indicate lip movement and images that do not indicate lip movement. The data of lip movement image signals is saved in shared memory and shared with the speech recognition process. This data is analyzed by the speech activity detection process, which is a pre-processing step of sound recognition. We combined a speech recognition processor and an image recognizer, and the interworking function successfully operated at the rate of 99.3%.
Keywords :
computer vision; image motion analysis; image recognition; object recognition; speech recognition; PC camera; face images; human speaker; image recognition; lip movement image signals; lip movement visual information; microphone; sound recognition; speech activity detection; speech audio information; speech recognition; Acoustic noise; Cameras; Data analysis; Face; Humans; Microphones; Signal processing; Speech analysis; Speech enhancement; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Computers and Signal Processing, 2007. PacRim 2007. IEEE Pacific Rim Conference on
Conference_Location :
Victoria, BC
Print_ISBN :
978-1-4244-1189-4
Electronic_ISBN :
1-4244-1190-4
Type :
conf
DOI :
10.1109/PACRIM.2007.4313259
Filename :
4313259
Link To Document :
بازگشت