Title :
Generating a variety of expressions from visual information and user-designated viewpoints
Author :
Noguchi, Y. ; Kondo, Makoto ; Kogure, Satoru ; Konishi, Tsuyoshi ; Itoh, Yoshio ; Takagi, A. ; Asoh, Hidek ; Kobayashi, Ichiro
Author_Institution :
Shizuoka Univ., Hamamatsu, Japan
Abstract :
This paper reports the development and evaluation of a natural language generation system which generates a variety of language expressions from visual information taken by a CCD camera. The feature of this system is to generate a variety of language expressions from combinations of different syntactic structures and different sets of vocabulary, while managing the generation process based on the user-designated viewpoints. The system converts the visual information into a concept dependency structure using a semantic representation framework proposed by Takagi and Itoh. The system then transforms the structure and divides it into a set of words, deriving a word dependency structure, which is later arranged into a sentence. The transformation of a concept dependency structure and the variation in word segmentation allow the system to generate a variety of sentences from the same visual information. In this paper, we employ user-designated viewpoints to scenes containing more than one object. We designed the parameters of the user-designated viewpoints which enable the system to manage the generation process and to generate a variety of expressions. An evaluation has confirmed that the system generates certain variations according to parameter values set by the user. The variations include expressions referring to attribute values of the objects in the scenes and relative expressions denoting the relations between the targeted object and others.
Keywords :
CCD image sensors; natural language processing; word processing; CCD camera; concept dependency structure; language expression generation; natural language generation system; semantic representation framework; syntactic structure combination; user-designated viewpoint; visual information; visual scene; vocabulary; word dependency structure; word segmentation; Educational institutions; Image color analysis; Natural languages; Prototypes; Semantics; Standards; Visualization; natural language generation; relative expressions; viewpoints; visual scene;
Conference_Titel :
Awareness Science and Technology and Ubi-Media Computing (iCAST-UMEDIA), 2013 International Joint Conference on
Conference_Location :
Aizuwakamatsu
DOI :
10.1109/ICAwST.2013.6765459