DocumentCode
3087673
Title
High performance sound source separation adaptable to environmental changes for robot audition
Author
Nakajima, Hirofumi ; Nakadai, Kazuhiro ; Hasegawa, Yuuji ; Tsujino, Hiroshi
Author_Institution
Honda Res. Inst. Japan Co., Ltd., Wako
fYear
2008
fDate
22-26 Sept. 2008
Firstpage
2165
Lastpage
2171
Abstract
This paper describes a novel sound source separation method for a robot that needs to cope with dynamically changing noises in the real world. The sound source separation method, Geometric Source Separation (GSS), is promising because it has high separation performance and requires low computational cost. One of the most important factors in GSS performance is a step-size parameter to update a separation matrix which is generally used for extracting a target sound source. A fixed value that was obtained empirically is commonly used as the step-size parameter. However, in the real world, the surrounding environment changes dynamically. Thus, conventional GSS with a fixed step-size parameter sometimes results in poor separation results, or divergence of the separation matrix. Another important factor is the weight parameter, which adjusts the balance between geometric errors and separation errors and also affects performance. If this parameter is set to a small value, GSS becomes similar to a Blind Source Separation method, by which the output signal may contain errors based on indefinite source amplitudes and orders. In contrast, if this parameter is set to a large value, GSS becomes similar to a delay-and-sum Beamforming method, which does not have high separation performance. GSS gives good performance when the parameters are tuned to an optimum value, which changes according to the environment. We propose two effective methods that can be used for general BSSpsilas. One is an adaptive step-size parameter control method. By using this method, the step-size and the weight parameters are automatically set to optimum values and are able to adapt to environmental changes. The other is an optima controlled recursive average method for correlation matrix estimation. This method can improve the estimation of a separation matrix, and achieve high separation performance. We evaluated the proposed GSS algorithm with an 8ch microphone array embedded in Honda ASIMO. Experimen- al results showed that the proposed method improved sound source separation even in dynamically changing environments.
Keywords
audio signal processing; correlation methods; optimal control; robots; source separation; adaptive step-size parameter control; correlation matrix estimation; geometric errors; geometric source separation; high performance sound source separation; optima controlled recursive average method; robot audition; separation errors; separation matrix; Correlation; Cost function; Microphones; Optimized production technology; Robots; Source separation; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on
Conference_Location
Nice
Print_ISBN
978-1-4244-2057-5
Type
conf
DOI
10.1109/IROS.2008.4650597
Filename
4650597
Link To Document