DocumentCode :
3415080
Title :
Challenges in deploying a microphone array to localize and separate sound sources in real auditory scenes
Author :
Bando, Yoshiaki ; Otsuka, Takuma ; Itoyama, Katsutoshi ; Yoshii, Kazuyoshi ; Sasaki, Yoko ; Kagami, Satoshi ; Okuno, Hiroshi G.
Author_Institution :
Kyoto Univ., Kyoto, Japan
fYear :
2015
fDate :
19-24 April 2015
Firstpage :
723
Lastpage :
727
Abstract :
Analyzing the auditory scene of real environments is challenging partly because an unknown number and type of sound sources are observed at the same time and partly because these sounds are observed on a significantly different sound pressure level at the microphone. These are difficult problems even with state-of-the-art sound source localization and separation methods. In this paper, we exploit two such methods using a microphone array: (1) Bayesian nonparametric microphone array processing (BNP-MAP), which is capable of separating and localizing sound sources when the number of sound sources is unspecified, and (2) robot audition software “HARK” is capable of separating and localizing in real time. Through experimentation, we found that BNP-MAP is more robust against differences in the sound pressure levels of the source signals and in the spatial closeness of source positions. Experiments analyzing real scenes of human conversations recorded in a big exhibition hall and bird calling recorded at a natural park demonstrate the efficacy and applicability of BNP-MAP.
Keywords :
microphone arrays; BNP-MAP; Bayesian nonparametric microphone array processing; real auditory scenes; real environments; separate sound sources; sound pressure level; source positions; source signals; spatial closeness; Array signal processing; Arrays; Mobile communication; Quality function deployment; Signal to noise ratio; Time-frequency analysis; Auditory scene analysis; Bayesian nonparametrics; simultaneous sound source localization and separation; sounds of different volume; unknown time-varying number of sources;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
Type :
conf
DOI :
10.1109/ICASSP.2015.7178064
Filename :
7178064
Link To Document :
بازگشت