DocumentCode
417165
Title
Online speaker clustering
Author
Lilt, D. ; Kubala, Francis
Author_Institution
BBN Technol., Cambridge, MA, USA
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
The paper describes a set of new algorithms that perform speaker clustering in an online fashion. Unlike typical clustering approaches, the proposed method does not require the presence of all the data before performing clustering. The clustering decision is made as soon as an audio segment is received. Being causal, this method enables low-latency incremental speaker adaptation in online speech-to-text systems. It also gives a speaker tracking and indexing system the ability to label speakers with cluster ID on the fly. We show that the new online speaker clustering method yields better performance compared to the traditional hierarchical speaker clustering. Evaluation metrics for speaker clustering are also discussed.
Keywords
speaker recognition; audio indexing; audio segment; online speaker clustering; online speech-to-text systems; speaker adaptation; speaker indexing; speaker tracking; speech recognition; Clustering algorithms; Clustering methods; Computational complexity; Gaussian distribution; Indexing; Labeling; Organizing; Speech recognition; Time factors;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1325990
Filename
1325990
Link To Document