Title :
Speaker diarization for multi-party meetings using acoustic fusion
Author :
Anguera, Xavier ; Woofers, C. ; Hernando, Javier
Author_Institution :
International Comput. Sci. Inst., Berkeley, CA
Abstract :
One of the sub-tasks of the Spring 2004 and Spring 2005 NIST Meetings evaluations requires segmenting multi-party meetings into speaker-homogeneous regions using data from multiple distant microphones (the "MDM" sub-task). One approach to this task is to run a speaker segmentation system on each of the microphone channels separately, and then merge the results. This can be thought of as a many-to-one post-processing approach. In this paper we propose an alternative approach in which we use delay-and-sum beamforming techniques to fuse the signals from each of the multiple distant microphones into a single enhanced signal. This approach can be thought of a many-to-one pre-processing approach. In the pre-processing approach we propose, the time delay of arrival (TDOA) between each of the multiple distant channels and a reference channel is computed incrementally using a window that steps through the signals from each of the multiple microphones. No information about the locations or setup of the microphones is required. Using the TDOA information, the channels are first aligned and then summed and the resulting "enhanced" signal is clustered using our standard speaker diarization system. We test our approach on the 2004 and 2005 NIST meetings evaluation databases and show that the technique performs very well
Keywords :
microphones; speaker recognition; acoustic fusion; beamforming techniques; microphone channels; multi-party meetings; multiple distant channels; multiple distant microphones; speaker diarization; speaker segmentation system; speaker-homogeneous regions; time delay of arrival; Array signal processing; Databases; Delay effects; Fuses; Loudspeakers; Microphones; NIST; Performance evaluation; Springs; Testing;
Conference_Titel :
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
Conference_Location :
San Juan
Print_ISBN :
0-7803-9478-X
Electronic_ISBN :
0-7803-9479-8
DOI :
10.1109/ASRU.2005.1566478