• DocumentCode
    699445
  • Title

    Learning Vector Quantization and Neural Predictive Coding for nonlinear speech feature extraction

  • Author

    Chetouani, M. ; Gas, B. ; Zarader, J.L.

  • Author_Institution
    Lab. des Instrum. et Syst. d´Ile-De-France, Univ. Paris VI, Paris, France
  • fYear
    2004
  • fDate
    6-10 Sept. 2004
  • Firstpage
    2059
  • Lastpage
    2062
  • Abstract
    Speech recognition is a special field of pattern recognition. In order to improve the performances of the systems, one can opt for several ways and among them the design of a feature extractor. This paper presents a new nonlinear feature extraction method based on the Learning Vector Quantization (LVQ) and the Neural Predictive Coding (NPC). The key idea of this work is to design a feature extractor, the NPC, by the introduction of discriminant constraint provided by the LVQ classifier. The performances are estimated on a phoneme classification task by several methods: GMM, MLP, LVQ. The phonemes are extracted from the NTIMIT database. We make comparisons with linear and nonlinear feature transformation methods (LDA, PCA, NLDA, NPCA), and also with coding methods (LPC, MFCC, PLP).
  • Keywords
    Gaussian processes; feature extraction; mixture models; signal classification; speech coding; vector quantisation; GMM; Gaussians mixture model; LVQ classifier; NTIMIT database; learning vector quantization; neural predictive coding; nonlinear feature transformation method; nonlinear speech feature extraction method; phoneme classification task; Abstracts; Adaptation models; Databases; Loss measurement; Mel frequency cepstral coefficient; Predictive models; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2004 12th European
  • Conference_Location
    Vienna
  • Print_ISBN
    978-320-0001-65-7
  • Type

    conf

  • Filename
    7079975