DocumentCode :
565608
Title :
The cocktail party robot: Sound source separation and localisation with an active binaural head
Author :
Deleforge, Antoine ; Horaud, Radu
Author_Institution :
INRIA Grenoble Rhone-Alpes, Montbonnot, France
fYear :
2012
fDate :
5-8 March 2012
Firstpage :
431
Lastpage :
438
Abstract :
Human-robot communication is often faced with the difficult problem of interpreting ambiguous auditory data. For example, the acoustic signals perceived by a humanoid with its on-board microphones contain a mix of sounds such as speech, music, electronic devices, all in the presence of attenuation and reverberations. In this paper we propose a novel method, based on a generative probabilistic model and on active binaural hearing, allowing a robot to robustly perform sound-source separation and localization. We show how interaural spectral cues can be used within a constrained mixture model specifically designed to capture the richness of the data gathered with two microphones mounted onto a human-like artificial head. We describe in detail a novel EM algorithm, we analyse its initialization, speed of convergence and complexity, and we assess its performance with both simulated and real data.
Keywords :
acoustic signal processing; computational complexity; expectation-maximisation algorithm; microphones; probability; robots; source separation; EM algorithm; active binaural head; active binaural hearing; ambiguous auditory data interpretation; cocktail party robot; constrained mixture model; generative probabilistic model; human-like artificial head; human-robot communication; interaural spectral cues; microphones; sound source localisation; sound source separation; Acoustics; Microphones; Robots; Source separation; Spectrogram; Speech; USA Councils; Blind source separation; EM algorithm; computational auditory scene analysis; learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Human-Robot Interaction (HRI), 2012 7th ACM/IEEE International Conference on
Conference_Location :
Boston, MA
ISSN :
2167-2121
Print_ISBN :
978-1-4503-1063-5
Electronic_ISBN :
2167-2121
Type :
conf
Filename :
6249602
Link To Document :
بازگشت