DocumentCode :
71078
Title :
Sound Source Localization for HRI Using FOC-Based Time Difference Feature and Spatial Grid Matching
Author :
Xiaofei Li ; Hong Liu
Author_Institution :
Key Lab. of Integrated Micro-Syst., Peking Univ., Shenzhen, China
Volume :
43
Issue :
4
fYear :
2013
fDate :
Aug. 2013
Firstpage :
1199
Lastpage :
1212
Abstract :
In human-robot interaction (HRI), speech sound source localization (SSL) is a convenient and efficient way to obtain the relative position between a speaker and a robot. However, implementing a SSL system based on TDOA method encounters many problems, such as noise of real environments, the solution of nonlinear equations, switch between far field and near field. In this paper, fourth-order cumulant spectrum is derived, based on which a time delay estimation (TDE) algorithm that is available for speech signal and immune to spatially correlated Gaussian noise is proposed. Furthermore, time difference feature of sound source and its spatial distribution are analyzed, and a spatial grid matching (SGM) algorithm is proposed for localization step, which handles some problems that geometric positioning method faces effectively. Valid feature detection algorithm and a decision tree method are also suggested to improve localization performance and reduce computational complexity. Experiments are carried out in real environments on a mobile robot platform, in which thousands of sets of speech data with noise collected by four microphones are tested in 3D space. The effectiveness of our TDE method and SGM algorithm is verified.
Keywords :
Gaussian noise; acoustic generators; acoustic signal processing; computational complexity; decision trees; delay estimation; feature extraction; geometry; higher order statistics; human-robot interaction; microphone arrays; mobile robots; path planning; position control; spectral analysis; speech processing; FOC-based time difference feature; HRI; SGM algorithm; SSL system; TDE algorithm; computational complexity reduction; decision tree method; far-field; feature detection algorithm; fourth-order cumulant spectrum; geometric positioning method; human-robot interaction; localization performance improvement; microphones; mobile robot platform; near-field; noise collection; nonlinear equation solution; spatial distribution analysis; spatial grid matching; spatial grid matching algorithm; spatially correlated Gaussian noise; speech data sets; speech signal; speech sound source localization; time delay estimation algorithm; time difference feature analysis; Arrays; Azimuth; Delay effects; Feature extraction; Microphones; Silicon; Speech; Fourth-order cumulant spectrum; human–robot interaction (HRI); spatial grid matching (SGM); spatially correlated Gaussian noise; speech sound source localization;
fLanguage :
English
Journal_Title :
Cybernetics, IEEE Transactions on
Publisher :
ieee
ISSN :
2168-2267
Type :
jour
DOI :
10.1109/TSMCB.2012.2226443
Filename :
6355814
Link To Document :
بازگشت