• DocumentCode
    3776196
  • Title

    Dictionary learning based sparse coefficients for speech recognition in noisy environment

  • Author

    R S Ramitha;M Baburaj;Sudhish N George

  • Author_Institution
    Electronics and Communication Engineering, National Institute of Technology, Calicut, Kearla, India
  • fYear
    2015
  • Firstpage
    151
  • Lastpage
    156
  • Abstract
    Automatic recognition of speech is an active area of research which provides a smooth platform for human-machine interaction. Complexity and recognition accuracy mainly depend on the selection of signal features and classifier. Commonly used features in the field of speech recognition are mel-frequency cepstral coefficients (MFCCs), line spectral frequencies (LSF), short time energy (STE) and linear prediction coefficients (LPC). In this paper, instead of using these well-established features, sparse feature derived from the dictionary of signal atoms using sparse coding is used for feature extraction. To improve the performance, artificial neural network (ANN) is used for the classification of isolated speech. Experimental results show that the proposed method works better in noisy environment upto 20dB SNR without using any speech enhancement method. To remove heavy background noise, a sparsity based speech enhancement algorithm is also proposed in the preprocessing stage of speech recognition. The proposed algorithm is compared with other popular speech recognition methods and it is observed that the proposed method can achieve better performance than the others.
  • Keywords
    "Dictionaries","Speech","Speech recognition","Feature extraction","Speech enhancement","Training","Matching pursuit algorithms"
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Computational Systems (RAICS), 2015 IEEE Recent Advances in
  • Type

    conf

  • DOI
    10.1109/RAICS.2015.7488405
  • Filename
    7488405