DocumentCode
304822
Title
Talking about 3D scenes: integration of image and speech understanding in a hybrid distributed system
Author
Socher, Gudrun ; Sagerer, Gerhard ; Kummert, Franz ; Fuhr, Thomas
Author_Institution
Bielefeld Univ., Germany
Volume
1
fYear
1996
fDate
16-19 Sep 1996
Firstpage
809
Abstract
We present a hybrid system that integrates speech and image understanding. Given spoken references, it is able to identify objects of a 3D scene perceived via a stereo camera. Central to our approach is the extraction of qualitative object features and spatial scene properties from acoustic and visual data. The interaction of the understanding processes is performed using a procedural semantic network that interfaces with signal recognition and reconstruction modules, thus integrating semantic, neural and Bayesian networks and Hidden Markov Models
Keywords
Bayes methods; computer vision; feature extraction; hidden Markov models; neural nets; semantic networks; speech recognition; visual databases; 3D scenes; Bayesian networks; Hidden Markov Models; hybrid distributed system; image understanding; neural nets; procedural semantic network; qualitative object features extraction; semantic nets; spatial scene properties; speech understanding; spoken references; stereo camera; Cameras; Cognitive science; Data mining; Humans; Image reconstruction; Knowledge representation; Layout; Signal processing; Speech; Stereo vision;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing, 1996. Proceedings., International Conference on
Conference_Location
Lausanne
Print_ISBN
0-7803-3259-8
Type
conf
DOI
10.1109/ICIP.1996.561028
Filename
561028
Link To Document