• DocumentCode
    669809
  • Title

    On investigating efficient methodology for Environmental Sound Recognition

  • Author

    Ruiz-Martinez, Cruz Alfredo ; Akhtar, Muhammad Tahir ; Washizawa, Yoshikazu ; Escamilla-Hernandez, E.

  • Author_Institution
    Univ. of Electro-Commun., Chofu, Japan
  • fYear
    2013
  • fDate
    12-15 Nov. 2013
  • Firstpage
    210
  • Lastpage
    214
  • Abstract
    This paper presents a comparative study of various methods to identify the environmental sounds. We evaluate two methods for feature extraction: Mel Frequency Cepstral Coefficients (MFCC) which is well known for speaker identification, and Matching Pursuit (MP) with Gabor Dictionary which gives a time frequency representation employed for scene recognition. In the classification stage, we show a comparison among Support Vector Machines (SVM), Logistic Regression, and Backpropagation Artificial Neural Network (BP-ANN). Simulation results show that MFCC gives a higher recognition performance as compared with MP. Furthermore, by concatenating MFCC features with some feature of MP, e.g., scale, might also improve performance in some situations. We observe that SVM show the best performance among the classifiers, for clean as well noisy signals.
  • Keywords
    Gabor filters; backpropagation; feature extraction; neural nets; regression analysis; speaker recognition; support vector machines; BP-ANN; Gabor dictionary; MFCC; MP; Matching Pursuit; Mel frequency cepstral coefficients; SVM; backpropagation artificial neural network; environmental sound recognition; feature extraction; investigating efficient methodology; logistic regression; scene recognition; speaker identification; support vector machines; time frequency representation; Dictionaries; Feature extraction; Glass; Matching pursuit algorithms; Mel frequency cepstral coefficient; Speech; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Signal Processing and Communications Systems (ISPACS), 2013 International Symposium on
  • Conference_Location
    Naha
  • Print_ISBN
    978-1-4673-6360-0
  • Type

    conf

  • DOI
    10.1109/ISPACS.2013.6704548
  • Filename
    6704548