مرکز منطقه ای اطلاع رساني علوم و فناوري - How visual salience influences natural language descriptions

DocumentCode :

2738184

Title :

How visual salience influences natural language descriptions

Author :

Maass, Wolfgang

Author_Institution :

Saarbrucken Univ., Germany

fYear :

1995

fDate :

34834

Firstpage :

42430

Lastpage :

42432

Abstract :

The model presented is part of a larger project called VITRA (visual translator), where we investigate aspects at the interaction of vision and language. A particular interest lies on the information flow from the analysis of visual data to language generation. We focus on how visual information can be used for grounding descriptions in the environment. In cooperation with the visual perception group of the IIFB at the Fraunhofer Institute at the University of Karlsruhe, we have shown how real world visual data in dynamic environments can be used in natural language descriptions. A model based approach is used for automatically generating 3D representations of the environment. Our current work is related to problems which occur if an agent moves through real or synthetic environments. The agent´s task during its movement is to incrementally describe a route from a starting point to a destination by refering to visually obtained objects. In general, the whole complexity of AI research is involved, e.g. control laws for movement, early vision processing, high level vision, naive physics, temporal and spatial reasoning, knowledge representation, planning and language processing. In a first approach, we have implemented a software agent called MOSES who describes a path in a synthetic 3D environment. MOSES can only refer to visually obtained objects (landmarks) in the current situation. Information about the path is extracted from a map by using an incremental path finding procedure

Keywords :

knowledge representation; natural language interfaces; natural languages; path planning; spatial reasoning; visual perception; 3D representations; AI research; MOSES; VITRA; dynamic environments; early vision processing; high level vision; information flow; knowledge representation; language generation; language processing; model based approach; naive physics; natural language descriptions; real world visual data; software agent; spatial reasoning; synthetic environments; visual information; visual perception; visual salience; visual translator; visually obtained objects;

fLanguage :

English

Publisher :

iet

Conference_Titel :

Grounding Representations: Integration of Sensory Information in Natural Language Processing, Artificial Intelligence and Neural Networks, IEE Colloquium on

Conference_Location :

London

Type :

conf

DOI :

10.1049/ic:19950663

Filename :

478375

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2738184