DocumentCode :
2339144
Title :
Auditory and visual integration based localization and tracking of humans in daily-life environments
Author :
Hyun-Don Kim ; Komatani, Kazunori ; Ogata, Tetsuya ; Okuno, Hiroshi G.
Author_Institution :
Kyoto Univ., Kyoto
fYear :
2007
fDate :
Oct. 29 2007-Nov. 2 2007
Firstpage :
2021
Lastpage :
2027
Abstract :
The purpose of this research is to develop techniques that enable robots to choose and track a desired person for interaction in daily-life environments. Therefore, localizing multiple moving sounds and human faces is necessary so that robots can locate a desired person. For sound source localization, we used a cross-power spectrum phase analysis (CSP) method and showed that CSP can localize sound sources only using two microphones and does not need impulse response data. An expectation-maximization (EM) algorithm was shown to enable a robot to cope with multiple moving sound sources. For face localization, we developed a method that can reliably detect several faces using the skin color classification obtained by using the EM algorithm. To deal with a change in color state according to illumination condition and various skin colors, the robot can obtain new skin color features of faces detected by OpenCV, an open vision library, for detecting human faces. Finally, we developed a probability based method to integrate auditory and visual information and to produce a reliable tracking path in real time. Furthermore, the developed system chose and tracked people while dealing with various background noises that are considered loud, even in the daily-life environments.
Keywords :
acoustic signal processing; expectation-maximisation algorithm; face recognition; feature extraction; image classification; image colour analysis; object detection; probability; robot vision; source separation; spectral analysis; target tracking; OpenCV; auditory information; auditory-visual integration based localization; cross-power spectrum phase analysis; daily-life environment; expectation-maximization algorithm; face localization; human face detection; human tracking; microphones; multiple moving sounds; open vision library; probability based method; robot; skin color classification; skin color features; sound source localization; visual information; Background noise; Change detection algorithms; Computer vision; Face detection; Humans; Libraries; Lighting; Microphones; Robot vision systems; Skin;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems, 2007. IROS 2007. IEEE/RSJ International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-1-4244-0912-9
Electronic_ISBN :
978-1-4244-0912-9
Type :
conf
DOI :
10.1109/IROS.2007.4399331
Filename :
4399331
Link To Document :
بازگشت