DocumentCode :
2165459
Title :
Real time speaker localization and detection system for camera steering in multiparticipant videoconferencing environments
Author :
Marti, Amparo ; Cobos, Maximo ; Lopez, Jose J.
Author_Institution :
Instituto de Telecomunicaciones y Aplicaciones Multimedia (iTEAM), Universidad Politécnica de Valencia, Italy
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
2592
Lastpage :
2595
Abstract :
A real time speaker localization and detection system for videoconferencing environments is presented. In this system, a recently proposed modified Steered Response Power - Phase Transform (SRP-PHAT) algorithm has been used as the core processing scheme. The new SRP-PHAT functional has been shown to provide robust localization performance in indoor environments without the need for having a very fine spatial grid, thus reducing the computational cost required in a practical implementation. Moreover, it has been demonstrated that the statistical distribution of location estimates when a speaker is active can be successfully used to discriminate between speech and non-speech frames by using a criterion of peakedness. As a result, talking participants can be detected and located with significant accuracy following a common processing framework.
Keywords :
Array signal processing; Cameras; Microphone arrays; Robustness; Speech; Teleconferencing; SRP-PHAT; microphone arrays; source localization; speaker detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague, Czech Republic
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5947015
Filename :
5947015
Link To Document :
بازگشت