• DocumentCode
    3144295
  • Title

    Automatic classification of audio data using nonlinear neural response models

  • Author

    Bach, Jörg-Hendrik ; Meyer, Arne-Freerk ; McElfresh, Duncan ; Anemüller, Jörn

  • Author_Institution
    Carl-von-Ossietzky Univ. Oldenburg, Oldenburg, Germany
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    357
  • Lastpage
    360
  • Abstract
    Physiologically inspired feature extraction for audio classification often uses simplified parametric models of auditory processing. We employ linear and nonlinear neuron models directly derived from neural responses in zebra finches as feature extraction front-ends. The most important features were identified using automatic feature selection techniques. This allows both a quantitative evaluation of neural features for sound classification tasks in terms of classification accuracy and a qualitative analysis of the auditory features that are most relevant. It turned out that a relatively small subpopulation of neural responses is sufficient to achieve reasonable classification performance. For linear as well as for nonlinear neuron models, we found three different shapes of spectro-temporal features to be archetypical. The relation of these to analytic approaches (such as Gabor filters) is discussed. The overall classification rates in a 6-class task reached up to 94% accuracy. Nonlinear models provided up to 15% benefit over linear models, indicating the importance of nonlinearities in classification with physiologically motivated features.
  • Keywords
    audio signal processing; feature extraction; physiology; signal classification; audio classification; audio data; auditory processing; automatic classification; feature extraction; neural responses; nonlinear neural response models; nonlinear neuron models; physiologically motivated features; zebra finches; Accuracy; Data models; Feature extraction; Modulation; Neurons; Niobium; Shape; audio classification; biological systems; physiologically motivated feature extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6287890
  • Filename
    6287890