DocumentCode :
394309
Title :
Online speaker clustering
Author :
Liu, Daben ; Kubala, Francis
Author_Institution :
BBN Technol., Cambridge, MA, USA
Volume :
1
fYear :
2003
fDate :
6-10 April 2003
Abstract :
This paper describes a set of new algorithms that perform speaker clustering in an online fashion. Unlike typical clustering approaches, the proposed method does not require the presence of all the data before performing clustering. The clustering decision is made as soon as an audio segment is received. Being causal, this method enables low-latency incremental speaker adaptation in online speech-to-text systems. It also gives a speaker tracking and indexing system the ability to label speakers with cluster ID on the fly. We show that the new online speaker clustering method yields better performance compared to the traditional hierarchical speaker clustering. Evaluation metrics for speaker clustering are also discussed.
Keywords :
pattern clustering; speech recognition; speech synthesis; audio segment; cluster ID; clustering decision; low-latency incremental speaker adaptation; online speaker clustering; online speech-to-text systems; speaker clustering algorithms; speaker indexing system; speaker tracking system; speech segments identification; Clustering algorithms; Clustering methods; Computational complexity; Gaussian distribution; Indexing; Labeling; Organizing; Speech recognition; Time factors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-7663-3
Type :
conf
DOI :
10.1109/ICASSP.2003.1198845
Filename :
1198845
Link To Document :
بازگشت