DocumentCode :
3408539
Title :
Improvement of sector based multiple speaker localization in a smart room
Author :
Hesam, M. ; Marvi, H.
Author_Institution :
Dept. of Electron. & Robotic Eng., Shahrood Univ. of Technol., Shahrood, Iran
fYear :
2010
fDate :
24-28 Oct. 2010
Firstpage :
470
Lastpage :
473
Abstract :
Recent advances in computer technology and speech processing and the interest on human-machine communication have made possible development of hands-free speech application with microphone array in smart room environments. One of the most important tasks in a smart room is localization of multi-speaker that permits a wide spectrum of application. Combined of hyperbolae produced by time delay estimation (TDE) between several microphones pair utilizes for source localization. In this paper, by using the TDE combination based on multiplication of spatial likelihood function (SLFs) generated from each microphone pair and the head orientation information, a new acoustic multi-speaker localization function has been proposed that we call it OPROD-PHAT. For the search space reduction divided the space of meeting room into a few sections, and for each time frame, we estimate the average OPROD-PHAT function output power within a volume of section, and by using a new two step adaptive threshold, we determined much better which sections contain active speaker. Finally we also implemented a closed-form TDOA based localization approaches for each active section. Has been shown it is a way to apply single speaker TDOA method to a multi-speaker problem. The result of simulation show superior performance of proposed system.
Keywords :
loudspeakers; microphones; speech processing; OPROD-PHAT; acoustic multi-speaker localization function; computer technology; hands-free speech application; human-machine communication; hyperbolae; microphone array; microphones; sector based multiple speaker localization; smart room environments; source localization; spatial likelihood function; speech processing; time delay estimation; Acoustics; Arrays; Delay; Estimation; Microphones; Speech; Speech processing; microphone array; multiperson localization; time delay of arrival(TDOA) head oriantation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing (ICSP), 2010 IEEE 10th International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-5897-4
Type :
conf
DOI :
10.1109/ICOSP.2010.5656145
Filename :
5656145
Link To Document :
بازگشت