DocumentCode
2456614
Title
Unsupervised Speaker Clustering in a Linear Discriminant Subspace
Author
Giannakopoulos, Theodoros ; Petridis, Sergios
Author_Institution
Comput. Intell. Lab., NCSR Demokritos, Athens, Greece
fYear
2010
fDate
12-14 Dec. 2010
Firstpage
1005
Lastpage
1009
Abstract
We present an approach for grouping single-speaker speech segments into speaker-specific clusters. Our approach is based on applying the K-means clustering algorithm to a suitable discriminant subspace, where the euclidean distance reflect speaker differences. A core feature of our approach is approximating speaker-conditional statistics, that are not available, with single-speaker segments statistics, which can be evaluated, thus making possible to apply the LDA algorithm for finding the optimal discriminative subspace, using unlabeled data. To illustrate our method, we present examples of clusters generated by our approach when applied to the ICMLA 2010 Speaker Clustering Challenge datasets.
Keywords
approximation theory; pattern clustering; speech processing; statistical analysis; ICMLA 2010 Speaker Clustering Challenge datasets; K-means clustering algorithm; euclidean distance; linear discriminant subspace; optimal discriminative subspace; single-speaker speech segment grouping; speaker-conditional statistics approximation; speaker-specific clustering; unlabeled data; unsupervised speaker clustering; Algorithm design and analysis; Approximation methods; Clustering algorithms; Covariance matrix; Feature extraction; Linear discriminant analysis; Speech; K-means; linear discriminant analysis; speaker clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
Conference_Location
Washington, DC
Print_ISBN
978-1-4244-9211-4
Type
conf
DOI
10.1109/ICMLA.2010.159
Filename
5708985
Link To Document