• DocumentCode
    699805
  • Title

    Feature dimensionality reduction through Genetic Algorithms for faster speaker recognition

  • Author

    Zamalloa, M. ; Rodriguez-Fuentes, L.J. ; Penagarikano, M. ; Bordel, G. ; Uribe, J.P.

  • Author_Institution
    Grupo de Trabajo en Tecnol. del Software, Univ. of the Basque Country, Leioa, Spain
  • fYear
    2008
  • fDate
    25-29 Aug. 2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Mel-Frequency Cepstral Coefficients and their derivatives are commonly used as acoustic features for speaker recognition. Reducing the number of features leads to more robust estimates of model parameters, and speeds up the classification task, which is crucial for real-time speaker recognition applications running on low-resource devices. In this paper, a feature selection procedure based on Genetic Algorithms (GA) is presented and compared to two well-known dimensionality reduction techniques, namely PCA and LDA. Evaluation is carried out for two speech databases, containing laboratory read speech and telephone spontaneous speech, applying a standard speaker recognition system. Results suggest that dynamic features are less discriminant than static ones, since the low-size optimal subsets found by the GA did not include dynamic features. GA-based feature selection outperformed PCA and LDA when dealing with clean speech, whereas PCA and LDA outperformed GA-based feature selection for telephone speech, probably due to some kind of noise compensation implicit in linear transforms, which cannot be accomplished just by selecting a subset of features.
  • Keywords
    feature selection; genetic algorithms; principal component analysis; real-time systems; speaker recognition; LDA; PCA; acoustic features; faster speaker recognition; feature dimensionality reduction; feature selection procedure; genetic algorithms; laboratory read speech; linear discriminant analysis; linear transforms; mel-frequency cepstral coefficients; noise compensation; principal component analysis; real-time speaker recognition; speech database; telephone spontaneous speech; Genetic algorithms; Mel frequency cepstral coefficient; Principal component analysis; Speaker recognition; Speech; Speech recognition; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2008 16th European
  • Conference_Location
    Lausanne
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7080337