• DocumentCode
    2633104
  • Title

    A Particle Swarm Optimization-Based Approach to Speaker Segmentation Based on Independent Component Analysis on GSM Digital Speech

  • Author

    Mirrezaie, S.M. ; Faez, Karim ; Asnaashari, Amir ; Ziaei, Ali

  • Author_Institution
    Dept. of Electr. Eng., Amirkabir Univ. of Technol., Tehran
  • fYear
    2008
  • fDate
    16-19 Dec. 2008
  • Firstpage
    502
  • Lastpage
    507
  • Abstract
    Adaptive Multi-Rate (AMR) codec was standardized for GSM in 1999. AMR offers substantial improvement over previous GSM speech codecs in error robustness by adapting speech and channel coding depending on channel conditions. The Adaptive Multi-Rate speech codec is adopted as a standard for IMT-2000 by ETSI and 3GPP and consists of eight source codecs with bit rates from 4.75 to 12.2 kbit/s. In this paper, we present an approach comprising of particle swarm optimization (PSO), which encodes possible segmentations of an audio record, and measures mutual information between these segments and the audio data. This measure is used as the fitness function for the PSO. A compact encoding of the solution for PSO which decreases the length of the PSO individuals and enhances the PSO convergence properties is adopted. The algorithm has been tested on two actual sets of data with AMR format for speaker segmentation, obtaining very good results in all test problems. The results have been compared to the widely used a genetic algorithm-based in several practical situations. No assumptions have been made about prior knowledge of speech signal characteristics. However, we assume that the speakers do not speak simultaneously and that we have no real-time constraints.
  • Keywords
    adaptive codes; audio coding; cellular radio; channel coding; independent component analysis; particle swarm optimisation; speech codecs; speech coding; GSM digital speech codec; PSO convergence; adaptive multirate speech codec; audio record segmentation; channel coding; independent component analysis; particle swarm optimization; speaker segmentation; Channel coding; Code standards; GSM; Independent component analysis; Particle swarm optimization; Robustness; Speech analysis; Speech codecs; Speech coding; Testing; Adaptive multirate (AMR); genetic algorithm; mutual information; particle swarm optimization (PSO); speaker segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Information Technology, 2008. ISSPIT 2008. IEEE International Symposium on
  • Conference_Location
    Sarajevo
  • Print_ISBN
    978-1-4244-3554-8
  • Electronic_ISBN
    978-1-4244-3555-5
  • Type

    conf

  • DOI
    10.1109/ISSPIT.2008.4775731
  • Filename
    4775731