DocumentCode :
186295
Title :
Multimodal space representation driven by self-evaluation of predictability
Author :
Lefort, M. ; Kopinski, Thomas ; Gepperth, Alexander
Author_Institution :
Comput. Sci. & Syst. Eng. Dept., ENSTA ParisTech, Palaiseau, France
fYear :
2014
fDate :
13-16 Oct. 2014
Firstpage :
319
Lastpage :
324
Abstract :
PROPRE is a generic and modular neural learning paradigm that autonomously extracts meaningful concepts of multimodal data flows driven by predictability across modalities in an unsupervised, incremental and online way. For that purpose, PROPRE consists of the combination of projection and prediction. Firstly, each data flow is topologically projected with a self-organizing map, largely inspired from the Kohonen model. Secondly, each projection is predicted by each other map activities, by mean of linear regressions. The main originality of PROPRE is the use of a simple and generic predictability measure that compares predicted and real activities for each modal stream. This measure drives the corresponding projection learning to favor the mapping of predictable stimuli across modalities at the system level (i.e. that their predictability measure overcomes some threshold). This predictability measure acts as a self-evaluation module that tends to bias the representations extracted by the system so that to improve their correlations across modalities. We already showed that this modulation mechanism is able to bootstrap representation extraction from previously learned representations with artificial multimodal data related to basic robotic behaviors [1] and improves performance of the system for classification of visual data within a supervised learning context [2]. In this article, we improve the self-evaluation module of PROPRE, by introducing a sliding threshold, and apply it to the unsupervised classification of gestures caught from two time-of-flight (ToF) cameras. In this context, we illustrate that the modulation mechanism is still useful although less efficient than purely supervised learning.
Keywords :
gesture recognition; image classification; learning (artificial intelligence); self-organising feature maps; Kohonen model; PROPRE; ToF cameras; artificial multimodal data; linear regressions; modular neural learning paradigm; modulation mechanism; multimodal data flows; multimodal space representation; predictability self-evaluation; predictable stimuli mapping; projection learning; representation extraction; robotic behaviors; self-evaluation module; self-organizing map; sliding threshold; supervised learning; two time-of-flight cameras; unsupervised gesture classification; visual data classification; Cameras; Computer architecture; Modulation; Robot sensing systems; Supervised learning; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Development and Learning and Epigenetic Robotics (ICDL-Epirob), 2014 Joint IEEE International Conferences on
Conference_Location :
Genoa
Type :
conf
DOI :
10.1109/DEVLRN.2014.6983000
Filename :
6983000
Link To Document :
بازگشت