Title :
A Two-level Method for Unsupervised Speaker-based Audio Segmentation
Author :
Zhang, Shilei ; Zhang, Shuwu ; Xu, Bo
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing
Abstract :
In this paper, we propose a two-level segmentation method that detects speaker changes in a continuous audio stream effectively. In our approach, we divide the change detection process into two levels: region level that detects the potential change regions containing candidate speaker change points, and boundary level that searches and refines the true change points. At the region level, we employ the modified generalized likelihood ratio (MGLR) metric to search for the potential change regions in continuous local windows. At the boundary level, we perform T2 and Bayesian information criterion (BIC) algorithm to detect segment boundaries within the potential windows. The experimental results on the 1997 Broadcast News Hub4-NE mandarin corpus show the efficiency of the proposed scheme
Keywords :
Bayes methods; audio signal processing; speaker recognition; Bayesian information criterion algorithm; T2 algorithm; continuous audio stream; modified generalized likelihood ratio; speaker change detection; unsupervised speaker-based audio segmentation; Automation; Bayesian methods; Broadcasting; Change detection algorithms; Decoding; Indexing; Robustness; Speech recognition; Statistics; Streaming media;
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2521-0
DOI :
10.1109/ICPR.2006.189