DocumentCode
3336973
Title
Automatic Image-to-Text-to-Voice Conversion for Interactively Locating Objects in Home Environments
Author
Bourbakis, Nikolaos
Author_Institution
ATRC, WSU, OH
Volume
2
fYear
2008
fDate
3-5 Nov. 2008
Firstpage
49
Lastpage
55
Abstract
The efficient processing and association of different multi-modal information is a very important research field with a great variety of applications, such as human computer interaction, knowledge discovery, document understanding, etc. A good approach to this important issue is the development of a common platform for converting different modalities (such as images, text, etc) into the same medium and associating them for efficient processing and understanding. Thus, this paper here presents the development of a novel methodology based on Local-Global (LG) graphs capable for automatically converting image context into natural language text sentences and then into speech for serving as an interactive model for locating missing objects in home environments. Simple illustrative examples are provided for proving the concept proposed here.
Keywords
graph theory; home computing; interactive systems; natural language processing; object recognition; speech synthesis; automatic image-to-text-to-voice conversion; home environment; interactive object location; local-global graph; natural language text sentence; Artificial intelligence; Data mining; Feature extraction; Image converters; Image edge detection; Image processing; Image recognition; Image retrieval; Image segmentation; Natural languages; Converting Images to NL-Text; Graphs; Image Analysis and Representation; Recognizing Objects;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence, 2008. ICTAI '08. 20th IEEE International Conference on
Conference_Location
Dayton, OH
ISSN
1082-3409
Print_ISBN
978-0-7695-3440-4
Type
conf
DOI
10.1109/ICTAI.2008.123
Filename
4669755
Link To Document