DocumentCode :
1661942
Title :
Faster BIC segmentation using local speaker modeling
Author :
Travadi, Ruchir ; Saha, Goutam
Author_Institution :
Dept. of Electron. & Electr. Commun. Eng., Indian Inst. of Technol., Kharagpur, India
fYear :
2012
Firstpage :
1
Lastpage :
5
Abstract :
Segmentation is typically the most computationally expensive step involved in majority of speaker diarization systems. Bayesian Information Criterion (BIC) is a very widely adopted method for segmentation of audio data. While BIC returns fairly good results in terms of segmentation performance, it suffers from the problem of enormous complexity. Moreover, BIC based diarization systems encounter the worst case complexity when there is no change point in the input audio stream at all. Many audio streams contain fairly large segments separated by a very few change points. In such cases, it becomes impractical to employ BIC segmentation because of its complexity. In this paper, we have proposed a modification to the baseline BIC segmentation scheme, which makes use of local search information to reduce the overall complexity of the segmentation procedure. The results have been tested on several audio streams from broadcast news and the diarization runtime has been found to get reduced by a factor of 3.45, with a marginally better segmentation performance.
Keywords :
Bayes methods; audio signal processing; speaker recognition; Bayesian information criterion; audio data segmentation; audio streams; baseline BIC segmentation scheme; broadcast news; change point; diarization runtime; input audio stream; local search information; local speaker modeling; overall complexity; segmentation performance; segmentation procedure; speaker diarization systems; worst case complexity; Bayesian methods; Cepstral analysis; Complexity theory; Data models; Runtime; Speech; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications (NCC), 2012 National Conference on
Conference_Location :
Kharagpur
Print_ISBN :
978-1-4673-0815-1
Type :
conf
DOI :
10.1109/NCC.2012.6176884
Filename :
6176884
Link To Document :
بازگشت