DocumentCode :
3528123
Title :
Speaker diarization using unsupervised discriminant analysis of inter-channel delay features
Author :
Evans, Nicholas W D ; Fredouille, Corinne ; Bonastre, Jean-François
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4061
Lastpage :
4064
Abstract :
When multiple microphones are available estimates of inter-channel delay, which characterise a speaker´s location, can be used as features for speaker diarization. Background noise and reverberation can, however, lead to noisy features and poor performance. To ameliorate these problems, this paper presents a new approach to the discriminant analysis of delay features for speaker diarization. This novel and nonetheless unsupervised approach aims to increase speaker separability in delay-space. We assess the approach on subsets of four standard NIST RT datasets and demonstrate a relative improvement in diarization error rate of 25% on a separate evaluation set using delay features alone.
Keywords :
speaker recognition; NIST RT datasets; inter-channel delay estimates; inter-channel delay features; microphones; speaker diarization; speaker separability; unsupervised discriminant analysis; Background noise; Delay effects; Delay estimation; Feature extraction; Loudspeakers; Microphones; NIST; Reverberation; Speech; Viterbi algorithm; Speaker diarization; multiple distant microphones;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960520
Filename :
4960520
Link To Document :
بازگشت