DocumentCode :
3849572
Title :
Online speaker segmentation and clustering using cross-likelihood ratio calculation with reference criterion selection
Author :
M. Grasic;M. Kos;Z. Kacic
Author_Institution :
Faculty of Electrical Engineering and Computer Science, University of Maribor, Maribor, Slovenia
Volume :
4
Issue :
6
fYear :
2010
Firstpage :
673
Lastpage :
685
Abstract :
This study presents a new online method for speaker segmentation and clustering in real-world environments. It analyses and discusses the difficulties of online speaker diarisation and proposes a new segmentation and clustering method, in which the Bayesian information criterion (BIC) and the normalised cross-likelihood ratio (NCLR) are combined into an online speaker diarisation system. A new decision parameter for BIC and NCLR is proposed using normalisation with reference criterion selection (NRCS), together with a window normalisation technique called window-length compensation (WLC), which normalises the criterion value according to analysed window length. The effectiveness of the proposed system and techniques in comparison to the standard offline speaker diarisation system (mClust) is demonstrated on the Slovenian Broadcast News database (BNSI) and an English Broadcast News database (the HUB-4). The online system presented in this study achieves similar performance to the BIC-based offline approach.
Journal_Title :
IET Signal Processing
Publisher :
iet
ISSN :
1751-9675
Type :
jour
DOI :
10.1049/iet-spr.2009.0235
Filename :
5665899
Link To Document :
بازگشت