• DocumentCode
    417784
  • Title

    Audio content description with wavelets and neural nets

  • Author

    Rein, Stephan ; Reisslein, Martin ; Sikora, Thomas

  • Author_Institution
    Tech. Univ. Berlin, Germany
  • Volume
    4
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    Precision audio content description is one of the key components of next generation Internet multimedia search machines. We examine the usability of a combination of 39 different wavelets and three different types of neural nets for precision audio content description. More specifically, we develop a novel wavelet dispersion measure that measures obtained ranks of wavelet coefficients. Our dispersion measure in conjunction with a probabilistic radial basis neural network trained by only three independent example sets obtains a success rate of approximately 78% in identifying unknown complex classical music movements.
  • Keywords
    Internet; audio signal processing; identification; learning (artificial intelligence); multimedia communication; music; radial basis function networks; search engines; wavelet transforms; independent example sets; multimedia search machines; neural nets; neural network training; next generation Internet; precision audio content description; probabilistic radial basis neural network; unknown classical music identification; usability; wavelet coefficient rank; wavelet dispersion measure; wavelets; Audio recording; Content based retrieval; Dispersion; Internet; Music information retrieval; Neural networks; Performance evaluation; Usability; Wavelet coefficients; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326833
  • Filename
    1326833