DocumentCode :
2769242
Title :
Efficient combination of parametric spaces, models and metrics for speaker diarization1
Author :
Stafylakis, Themos ; Katsouros, Vassilis ; Carayannis, George
Author_Institution :
Inst. for Language & Speech Process., Athens
fYear :
2007
fDate :
9-13 Dec. 2007
Firstpage :
256
Lastpage :
261
Abstract :
In this paper we present a method of combining several acoustic parametric spaces, statistical models and distance metrics in speaker diarization task. Focusing our interest on the post-segmentation part of the problem, we adopt an incremental feature selection and fusion algorithm based on the Maximum Entropy Principle and Iterative Scaling Algorithm that combines several statistical distance measures on speech-chunk pairs. By this approach, we place the merging-of-chunks clustering process into a probabilistic framework. We also propose a decomposition of the input space according to gender, recording conditions and chunk lengths. The algorithm produced highly competitive results compared to GMM-UBM state-of-the-art methods.
Keywords :
iterative methods; meta data; speaker recognition; statistical analysis; iterative scaling algorithm; maximum entropy principle; parametric space; speaker diarization task; speech-chunk pair; Bandwidth; Clustering algorithms; Entropy; Extraterrestrial measurements; Iterative algorithms; Iterative methods; Loudspeakers; Natural languages; Space technology; Speech; Fusion; Maximum Entropy; Single and Multimedia Indexing; Speaker Diarization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition & Understanding, 2007. ASRU. IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-1746-9
Electronic_ISBN :
978-1-4244-1746-9
Type :
conf
DOI :
10.1109/ASRU.2007.4430120
Filename :
4430120
Link To Document :
بازگشت