DocumentCode
2169915
Title
Alignment of binocular-binaural data using a moving audio-visual target
Author
Khalidov, Vasil ; Forbes, Florence ; Horaud, Radu
Author_Institution
INRIA Grenoble Rhone-Alpes, Montbonnot Saint-Martin, France
fYear
2013
fDate
Sept. 30 2013-Oct. 2 2013
Firstpage
242
Lastpage
247
Abstract
In this paper we address the problem of aligning visual (V) and auditory (A) data using a sensor that is composed of a camera-pair and a microphone-pair. The original contribution of the paper is a method for AV data aligning through estimation of the 3D positions of the microphones in the visual-centred coordinate frame defined by the stereo camera-pair. We exploit the fact that these two distinct data sets are conditioned by a common set of parameters, namely the (unknown) 3D trajectory of an AV object, and derive an EM-like algorithm that alternates between the estimation of the microphone-pair position and the estimation of the AV object trajectory. The proposed algorithm has a number of built-in features: it can deal with A and V observations that are misaligned in time, it estimates the reliability of the data, it is robust to outliers in both modalities, and it has proven theoretical convergence. We report experiments with both simulated and real data.
Keywords
audio-visual systems; cameras; expectation-maximisation algorithm; microphones; robot vision; stereo image processing; 3D positions; 3D trajectory; AV data aligning through estimation; AV object trajectory; EM-like algorithm; audio-visual target moving; binocular-binaural data alignment; humanoid robot Nao; microphone-pair estimation; microphone-pair position; stereo camera-pair; visual aligning problem; Calibration; Cameras; Microphones; Noise; Three-dimensional displays; Trajectory; Visualization;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing (MMSP), 2013 IEEE 15th International Workshop on
Conference_Location
Pula
Type
conf
DOI
10.1109/MMSP.2013.6659295
Filename
6659295
Link To Document