• DocumentCode
    547721
  • Title

    Comparison of linear based feature transformations to improve speech recognition performance

  • Author

    Shekofteh, Yasser ; Almasganj, Farshad ; Goodarzi, Mohammad Mohsen

  • Author_Institution
    Biomedical Engineering Faculty, Amirkabir University of Technology, Iran
  • fYear
    2011
  • fDate
    17-19 May 2011
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In automatic speech recognition system a diagonal GMM based CDHMM modeling is commonly used. So there is a need to use reasonable feature transformation to decorrelate input feature vectors to satisfy diagonal GMM assumption. In this paper, we introduce the utilization of the several supervised linear feature transformation in speech recognition tasks. Specially each of these methods has particular projection properties. We show that the proposed OLPP based feature transformation method with preserving local properties of feature vectors in the projected space has the best performance based on our experiment on Persian speech database FARSDAT. Also we has introduced a novel class labeling method to use the supervised feature transformation. Overall system, compared to the baseline features, achieved an error rate reduction of 22.2% on clean condition.
  • Keywords
    Hidden Markov models; Labeling; Mel frequency cepstral coefficient; Speech; Speech recognition; Vectors; HLDA; LDA; LPP; OLPP; feature extraction; linear transformation; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical Engineering (ICEE), 2011 19th Iranian Conference on
  • Conference_Location
    Tehran, Iran
  • Print_ISBN
    978-1-4577-0730-8
  • Electronic_ISBN
    978-964-463-428-4
  • Type

    conf

  • Filename
    5955610