Title :
OuluVS2: A multi-view audiovisual database for non-rigid mouth motion analysis
Author :
Anina, Iryna ; Ziheng Zhou ; Guoying Zhao ; Pietikainen, Matti
Author_Institution :
Center for Machine Vision Res., Univ. of Oulu, Oulu, Finland
Abstract :
Visual speech constitutes a large part of our nonrigid facial motion and contains important information that allows machines to interact with human users, for instance, through automatic visual speech recognition (VSR) and speaker verification. One of the major obstacles to research of non-rigid mouth motion analysis is the absence of suitable databases. Those available for public research either lack a sufficient number of speakers or utterances or contain constrained view points, which limits their representativeness and usefulness. This paper introduces a newly collected multi-view audiovisual database for non-rigid mouth motion analysis. It includes more than 50 speakers uttering three types of utterances and more importantly, thousands of videos simultaneously recorded by six cameras from five different views spanned between the frontal and profile views. Moreover, a simple VSR system has been developed and tested on the database to provide some baseline performance.
Keywords :
cameras; image motion analysis; sign language recognition; speaker recognition; OuluVS2; VSR system; automatic visual speech recognition; cameras; multiview audiovisual database; nonrigid facial mouth motion analysis; speaker verification; visual speech constitutes; Cameras; Databases; Mouth; Speech; Synchronization; Videos; Visualization;
Conference_Titel :
Automatic Face and Gesture Recognition (FG), 2015 11th IEEE International Conference and Workshops on
Conference_Location :
Ljubljana
DOI :
10.1109/FG.2015.7163155