مرکز منطقه ای اطلاع رساني علوم و فناوري - Multimodal space representation driven by self-evaluation of predictability

DocumentCode :

186295

Title :

Multimodal space representation driven by self-evaluation of predictability

Author :

Lefort, M. ; Kopinski, Thomas ; Gepperth, Alexander

Author_Institution :

Comput. Sci. & Syst. Eng. Dept., ENSTA ParisTech, Palaiseau, France

fYear :

2014

fDate :

13-16 Oct. 2014

Firstpage :

319

Lastpage :

324

Abstract :

PROPRE is a generic and modular neural learning paradigm that autonomously extracts meaningful concepts of multimodal data flows driven by predictability across modalities in an unsupervised, incremental and online way. For that purpose, PROPRE consists of the combination of projection and prediction. Firstly, each data flow is topologically projected with a self-organizing map, largely inspired from the Kohonen model. Secondly, each projection is predicted by each other map activities, by mean of linear regressions. The main originality of PROPRE is the use of a simple and generic predictability measure that compares predicted and real activities for each modal stream. This measure drives the corresponding projection learning to favor the mapping of predictable stimuli across modalities at the system level (i.e. that their predictability measure overcomes some threshold). This predictability measure acts as a self-evaluation module that tends to bias the representations extracted by the system so that to improve their correlations across modalities. We already showed that this modulation mechanism is able to bootstrap representation extraction from previously learned representations with artificial multimodal data related to basic robotic behaviors [1] and improves performance of the system for classification of visual data within a supervised learning context [2]. In this article, we improve the self-evaluation module of PROPRE, by introducing a sliding threshold, and apply it to the unsupervised classification of gestures caught from two time-of-flight (ToF) cameras. In this context, we illustrate that the modulation mechanism is still useful although less efficient than purely supervised learning.

Keywords :

gesture recognition; image classification; learning (artificial intelligence); self-organising feature maps; Kohonen model; PROPRE; ToF cameras; artificial multimodal data; linear regressions; modular neural learning paradigm; modulation mechanism; multimodal data flows; multimodal space representation; predictability self-evaluation; predictable stimuli mapping; projection learning; representation extraction; robotic behaviors; self-evaluation module; self-organizing map; sliding threshold; supervised learning; two time-of-flight cameras; unsupervised gesture classification; visual data classification; Cameras; Computer architecture; Modulation; Robot sensing systems; Supervised learning; Unsupervised learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Development and Learning and Epigenetic Robotics (ICDL-Epirob), 2014 Joint IEEE International Conferences on

Conference_Location :

Genoa

Type :

conf

DOI :

10.1109/DEVLRN.2014.6983000

Filename :

6983000

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=186295