DocumentCode :
506748
Title :
An unsupervised scheme for speaker indexing of audio databases
Author :
Chen, Yanxiang ; Liu, Ming
Author_Institution :
Coll. of Comput. Sci. & Inf., Hefei Univ. of Technol., Hefei, China
Volume :
3
fYear :
2009
fDate :
20-22 Nov. 2009
Firstpage :
90
Lastpage :
93
Abstract :
Speaker indexing of an audio database consists in organizing the audio data according to the speakers present in the database. This paper investigates on segmenting and clustering continuous audio streams automatically by speaker with no prior speaker model. It is composed of two steps: (1) segmentation based on GLR distance measure and BIC refinement, (2) clustering based on agglomerative clustering and pruning selection. The aim is to produce just one pure cluster for every speaker. Results are presented using the data sets derived from the Switchboard corpus and the effectiveness of the proposed scheme is shown.
Keywords :
audio databases; pattern clustering; unsupervised learning; BIC refinement; Bayesian information criterion; GLR distance measurement; Switchboard corpus; agglomerative clustering; audio databases; audio streams clustering; audio streams segmentation; generalized likelihood ratio; pruning selection; speaker indexing; unsupervised indexing scheme; Audio databases; Computer science; Data engineering; Educational institutions; Indexing; Merging; Organizing; Speech; Streaming media; Time measurement; BIC refinement; GLR distance measure; pruning selection; speaker indexing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Computing and Intelligent Systems, 2009. ICIS 2009. IEEE International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-4754-1
Electronic_ISBN :
978-1-4244-4738-1
Type :
conf
DOI :
10.1109/ICICISYS.2009.5358240
Filename :
5358240
Link To Document :
بازگشت