Title :
Offline speaker segmentation using genetic algorithms and mutual information
Author :
Salcedo-Sanz, Sancho ; Gallardo-Antolín, Ascensión ; Leiva-Murillo, José Miguel ; Bousono-Calzón, Carlos
Author_Institution :
Dept. of Signal Theor. & Commun., Univ. de Alcala, Madrid, Spain
fDate :
4/1/2006 12:00:00 AM
Abstract :
We present an evolutionary approach to speaker segmentation, an activity that is especially important prior to speaker recognition and audio content analysis tasks. Our approach consists of a genetic algorithm (GA), which encodes possible segmentations of an audio record, and a measure of mutual information between the audio data and possible segmentations, which is used as fitness function for the GA. We introduce a compact encoding of the problem into the GA which reduces the length of the GA individuals and improves the GA convergence properties. Our algorithm has been tested on the segmentation of real audio data, and its performance has been compared with several existing algorithms for speaker segmentation, obtaining very good results in all test problems.
Keywords :
convergence; genetic algorithms; speaker recognition; GA convergence properties; audio content analysis tasks; audio record; compact encoding; evolutionary approach; fitness function; genetic algorithms; mutual information; offline speaker segmentation; speaker recognition; Audio databases; Genetic algorithms; Hidden Markov models; Image segmentation; Indexing; Information retrieval; Mutual information; TV; Testing; Unsupervised learning; Genetic algorithms (GAs); mutual information; speaker segmentation; unsupervised learning;
Journal_Title :
Evolutionary Computation, IEEE Transactions on
DOI :
10.1109/TEVC.2005.857079