DocumentCode :
3709865
Title :
Audio-visual beat tracking based on a state-space model for a music robot dancing with humans
Author :
Misato Ohkita;Yoshiaki Bando;Yukara Ikemiya;Katsutoshi Itoyama;Kazuyoshi Yoshii
Author_Institution :
Graduate School of Informatics, Kyoto University, Sakyo-ku, 606-8501, Japan
fYear :
2015
fDate :
9/1/2015 12:00:00 AM
Firstpage :
5555
Lastpage :
5560
Abstract :
This paper presents an audio-visual beat-tracking method for an entertainment robot that can dance in synchronization with music and human dancers. Conventional music robots have focused on either music audio signals or dancing movements of humans for detecting and predicting beat times in real time. Since a robot needs to record music audio signals by using its own microphones, however, the signals are severely contaminated with loud environmental noise and reverberant sounds. Moreover, it is difficult to visually detect beat times from real complicated dancing movements that exhibit weaker repetitive characteristics than music audio signals do. To solve these problems, we propose a state-space model that integrates both audio and visual information in a probabilistic manner. At each frame, the method extracts acoustic features (audio tempos and onset likelihoods) from music audio signals and extracts skeleton features from movements of a human dancer. The current tempo and the next beat time are then estimated from those observed features by using a particle filter. Experimental results showed that the proposed multi-modal method using a depth sensor (Kinect) for extracting skeleton features outperformed conventional mono-modal methods by 0.20 (F measure) in terms of beat-tracking accuracy in a noisy and reverberant environment.
Keywords :
"Feature extraction","Robots","Skeleton","Multiple signal classification","Visualization","Acoustics","State-space methods"
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on
Type :
conf
DOI :
10.1109/IROS.2015.7354164
Filename :
7354164
Link To Document :
بازگشت