• DocumentCode
    2505942
  • Title

    An Empirical Study of Feature Extraction Methods for Audio Classification

  • Author

    Parker, Charles

  • fYear
    2010
  • fDate
    23-26 Aug. 2010
  • Firstpage
    4593
  • Lastpage
    4596
  • Abstract
    With the growing popularity of video sharing web sites and the increasing use of consumer-level video capture devices, new algorithms are needed for intelligent searching and indexing of such data. The audio from these video streams is particularly challenging due to its low quality and high variability. Here, we perform a broad empirical study of features used for intelligent audio processing. We perform experiments on a dataset of 200 consumer videos over which we attempt to detect 10 semantic audio concepts.
  • Keywords
    Web sites; audio signal processing; feature extraction; video signal processing; video streaming; audio classification; consumer-level video capture devices; feature extraction methods; intelligent audio processing; video sharing Web sites; video streams; Conferences; Feature extraction; Mel frequency cepstral coefficient; Semantics; Signal processing algorithms; Speech; Training; Audio Classification; Audio Event; Audio Features; Consumer Video; Gammatone; MFCC; Wavelet;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2010 20th International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-7542-1
  • Type

    conf

  • DOI
    10.1109/ICPR.2010.1111
  • Filename
    5597350