• DocumentCode
    671523
  • Title

    Speaker recognition based on SOINN and incremental learning Gaussian mixture model

  • Author

    Zelin Tang ; Furao Shen ; Jinxi Zhao

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Nanjing Univ., Nanjing, China
  • fYear
    2013
  • fDate
    4-9 Aug. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Gaussian Mixture Models has been widely used in speaker recognition during the last decades. To deal with the dynamic growth of datasets, initial clustering problem and achieving the results of clustering effectively on incremental data, an incremental adaptation method called incremental learning Gaussian mixture model (IGMM) is proposed in this paper. It was applied to speaker recognition system based on Self Organization Incremental Learning Neural Network (SOINN) and improved EM algorithm. SOINN is a Neural Network which can reach a suitable mixture number and appropriate initial cluster for each model. First, the initial training is conducted by SOINN and EM algorithm only need a limited amount of data. Then, the model would adapt to the data available in each session to enrich itself incrementally and recursively. Experiments were taken on the 1st speech separation challenge database. The results show that IGMM outperforms GMM and classical Bayesian adaptation in most of the cases.
  • Keywords
    Gaussian processes; expectation-maximisation algorithm; learning (artificial intelligence); pattern clustering; self-organising feature maps; speaker recognition; EM algorithm; IGMM; SOINN; incremental adaptation method; incremental data clustering; incremental learning Gaussian mixture model; initial clustering problem; maximum likelihood estimation; speaker recognition system; Clustering algorithms; Gaussian mixture model; Mel frequency cepstral coefficient; Speaker recognition; Speech; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks (IJCNN), The 2013 International Joint Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    2161-4393
  • Print_ISBN
    978-1-4673-6128-6
  • Type

    conf

  • DOI
    10.1109/IJCNN.2013.6706863
  • Filename
    6706863