DocumentCode :
3723736
Title :
Impact of each camera on multiple camera visual speech recognizer using ANOVA: A brief study
Author :
Astik Biswas;P.K. Sahu;Mahesh Chandra
Author_Institution :
Dept of Electrical Engineering, National Institute of Technology, Rourkela, India-769008
fYear :
2015
Firstpage :
1
Lastpage :
5
Abstract :
Multiple camera fusion technique is an imperative part of multi-camera computer vision applications. Visual modality plays a vital role in computer vision systems when the acoustic modality is corrupted by the background noise. Multiple camera protocol allows the user to move freely and can provide complementary information to each other. This study shows the influence of each camera on visual speech recognizer using the one-way analysis of variance (ANOVA). We choose a real world four cameras audio-visual corpus “AVICAR” for this study. ANOVA is applied to the each pair of the camera to explore the effect of different viewing angle. This ANOVA test shows the influence of side and central faced camera on AVICAR visual speech recognizer (VSR). Based on the ANOVA F-statistics test multiple camera streams are fused into one visual feature vector. Dynamic visual speech information is captured using Motion History Images (MHI). Zernike Moments (ZM) are used as the visual feature to carry out the study. Four camera visual features show ample improvement over single camera visual features across all driving condition.
Keywords :
"Cameras","Speech"
Publisher :
ieee
Conference_Titel :
TENCON 2015 - 2015 IEEE Region 10 Conference
ISSN :
2159-3442
Print_ISBN :
978-1-4799-8639-2
Electronic_ISBN :
2159-3450
Type :
conf
DOI :
10.1109/TENCON.2015.7372979
Filename :
7372979
Link To Document :
بازگشت