DocumentCode
2633104
Title
A Particle Swarm Optimization-Based Approach to Speaker Segmentation Based on Independent Component Analysis on GSM Digital Speech
Author
Mirrezaie, S.M. ; Faez, Karim ; Asnaashari, Amir ; Ziaei, Ali
Author_Institution
Dept. of Electr. Eng., Amirkabir Univ. of Technol., Tehran
fYear
2008
fDate
16-19 Dec. 2008
Firstpage
502
Lastpage
507
Abstract
Adaptive Multi-Rate (AMR) codec was standardized for GSM in 1999. AMR offers substantial improvement over previous GSM speech codecs in error robustness by adapting speech and channel coding depending on channel conditions. The Adaptive Multi-Rate speech codec is adopted as a standard for IMT-2000 by ETSI and 3GPP and consists of eight source codecs with bit rates from 4.75 to 12.2 kbit/s. In this paper, we present an approach comprising of particle swarm optimization (PSO), which encodes possible segmentations of an audio record, and measures mutual information between these segments and the audio data. This measure is used as the fitness function for the PSO. A compact encoding of the solution for PSO which decreases the length of the PSO individuals and enhances the PSO convergence properties is adopted. The algorithm has been tested on two actual sets of data with AMR format for speaker segmentation, obtaining very good results in all test problems. The results have been compared to the widely used a genetic algorithm-based in several practical situations. No assumptions have been made about prior knowledge of speech signal characteristics. However, we assume that the speakers do not speak simultaneously and that we have no real-time constraints.
Keywords
adaptive codes; audio coding; cellular radio; channel coding; independent component analysis; particle swarm optimisation; speech codecs; speech coding; GSM digital speech codec; PSO convergence; adaptive multirate speech codec; audio record segmentation; channel coding; independent component analysis; particle swarm optimization; speaker segmentation; Channel coding; Code standards; GSM; Independent component analysis; Particle swarm optimization; Robustness; Speech analysis; Speech codecs; Speech coding; Testing; Adaptive multirate (AMR); genetic algorithm; mutual information; particle swarm optimization (PSO); speaker segmentation;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Information Technology, 2008. ISSPIT 2008. IEEE International Symposium on
Conference_Location
Sarajevo
Print_ISBN
978-1-4244-3554-8
Electronic_ISBN
978-1-4244-3555-5
Type
conf
DOI
10.1109/ISSPIT.2008.4775731
Filename
4775731
Link To Document