DocumentCode
249142
Title
Pixelwise object class segmentation based on synthetic data using an optimized training strategy
Author
Dittrich, Frank ; Woern, Heinz ; Sharma, Vishal ; Yayilgan, Sule
Author_Institution
Inst. for Process Control & Robot, Karlsruher Inst. of Technol. (KIT), Karlsruhe, Germany
fYear
2014
fDate
19-20 Aug. 2014
Firstpage
388
Lastpage
394
Abstract
In this paper we present an approach for low-level body part segmentation based on RGB-D data. The RGB-D sensor is thereby placed at the ceiling and observes a shared workspace for human-robot collaboration in the industrial domain. The pixelwise information about certain body parts of the human worker is used by a cognitive system for the optimization of interaction and collaboration processes. In this context, for rational decision making and planning, the pixelwise predictions must be reliable despite the high variability of the appearance of the human worker. In our approach we treat the problem as a pixelwise classification task, where we train a random decision forest classifier on the information contained in depth frames produced by a synthetic representation of the human body and the ceiling sensor, in a virtual environment. As shown in similar approaches, the samples used for training need to cover a broad spectrum of the geometrical characteristics of the human, and possible transformations of the body in the scene. In order to reduce the number of training samples and the complexity of the classifier training, we therefore apply an elaborated and coupled strategy for randomized training data sampling and feature extraction. This allows us to reduce the training set size and training time, by decreasing the dimensionality of the sampling parameter space. In order to keep the creation of synthetic training samples and real-world ground truth data simple, we use a highly reduced virtual representation of the human body, in combination with KINECT skeleton tracking data from a calibrated multi-sensor setup. The optimized training and simplified sample creation allows us to deploy standard hardware for the realization of the presented approach, while yielding a reliable segmentation in real-time, and high performance scores in the evaluation.
Keywords
cognitive systems; feature extraction; image classification; image colour analysis; image representation; image segmentation; learning (artificial intelligence); sensor fusion; KINECT skeleton tracking data; RGB-D data; RGB-D sensor; classifier training; cognitive system; collaboration processes; feature extraction; human-robot collaboration; interaction processes; low-level body part segmentation; multisensor setup; optimized training strategy; pixelwise classification task; pixelwise information; pixelwise object class segmentation; random decision forest classifier; randomized training data sampling; sampling parameter space dimensionality reduction; synthetic data; synthetic human body representation; virtual environment; virtual human body representation; Feature extraction; Image segmentation; Labeling; Optimization; Robot sensing systems; Training; Vegetation;
fLanguage
English
Publisher
ieee
Conference_Titel
Networks & Soft Computing (ICNSC), 2014 First International Conference on
Conference_Location
Guntur
Print_ISBN
978-1-4799-3485-0
Type
conf
DOI
10.1109/CNSC.2014.6906671
Filename
6906671
Link To Document