مرکز منطقه ای اطلاع رساني علوم و فناوري - Pixelwise object class segmentation based on synthetic data using an optimized training strategy

DocumentCode :

249142

Title :

Pixelwise object class segmentation based on synthetic data using an optimized training strategy

Author :

Dittrich, Frank ; Woern, Heinz ; Sharma, Vishal ; Yayilgan, Sule

Author_Institution :

Inst. for Process Control & Robot, Karlsruher Inst. of Technol. (KIT), Karlsruhe, Germany

fYear :

2014

fDate :

19-20 Aug. 2014

Firstpage :

388

Lastpage :

394

Abstract :

In this paper we present an approach for low-level body part segmentation based on RGB-D data. The RGB-D sensor is thereby placed at the ceiling and observes a shared workspace for human-robot collaboration in the industrial domain. The pixelwise information about certain body parts of the human worker is used by a cognitive system for the optimization of interaction and collaboration processes. In this context, for rational decision making and planning, the pixelwise predictions must be reliable despite the high variability of the appearance of the human worker. In our approach we treat the problem as a pixelwise classification task, where we train a random decision forest classifier on the information contained in depth frames produced by a synthetic representation of the human body and the ceiling sensor, in a virtual environment. As shown in similar approaches, the samples used for training need to cover a broad spectrum of the geometrical characteristics of the human, and possible transformations of the body in the scene. In order to reduce the number of training samples and the complexity of the classifier training, we therefore apply an elaborated and coupled strategy for randomized training data sampling and feature extraction. This allows us to reduce the training set size and training time, by decreasing the dimensionality of the sampling parameter space. In order to keep the creation of synthetic training samples and real-world ground truth data simple, we use a highly reduced virtual representation of the human body, in combination with KINECT skeleton tracking data from a calibrated multi-sensor setup. The optimized training and simplified sample creation allows us to deploy standard hardware for the realization of the presented approach, while yielding a reliable segmentation in real-time, and high performance scores in the evaluation.

Keywords :

cognitive systems; feature extraction; image classification; image colour analysis; image representation; image segmentation; learning (artificial intelligence); sensor fusion; KINECT skeleton tracking data; RGB-D data; RGB-D sensor; classifier training; cognitive system; collaboration processes; feature extraction; human-robot collaboration; interaction processes; low-level body part segmentation; multisensor setup; optimized training strategy; pixelwise classification task; pixelwise information; pixelwise object class segmentation; random decision forest classifier; randomized training data sampling; sampling parameter space dimensionality reduction; synthetic data; synthetic human body representation; virtual environment; virtual human body representation; Feature extraction; Image segmentation; Labeling; Optimization; Robot sensing systems; Training; Vegetation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Networks & Soft Computing (ICNSC), 2014 First International Conference on

Conference_Location :

Guntur

Print_ISBN :

978-1-4799-3485-0

Type :

conf

DOI :

10.1109/CNSC.2014.6906671

Filename :

6906671

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=249142