• DocumentCode
    2285450
  • Title

    Using the Fisher kernel method for Web audio classification

  • Author

    Moreno, Pedro J. ; Rifkin, Ryan

  • Author_Institution
    Cambridge Res. Lab., Compaq Comput. Corp., Cambridge, MA, USA
  • Volume
    6
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    2417
  • Abstract
    As the multimedia content of the Web increases techniques to automatically classify this content become more important. We present a system to classify audio files collected from the Web. The system classifies any audio file as belonging to one of three categories: speech, music and other. To classify the audio files, we use the technique of Fisher kernels. The technique as proposed by Jaakkola (1998) assumes a probabilistic generative model for the data, in our case a Gaussian mixture model. Then a discriminative classifier uses the GMM as an intermediate step to produce appropriate feature vectors. Support vector machines are our choice of discriminative classifier. We present classification results on a collection of more than 173 hours of Web audio randomly collected. We believe our results represent one of the first realistic studies of audio classification performance on found data. Our final system yielded a classification rate of 81.8%
  • Keywords
    Internet; classification; information resources; learning automata; multimedia systems; probability; Fisher kernel method; Gaussian mixture model; Web audio classification; Web multimedia content; content classification; discriminative classifier; feature vectors; music; probabilistic generative model; speech; support vector machines; Art; Audio recording; Covariance matrix; Hidden Markov models; Integrated circuit modeling; Kernel; Labeling; Laboratories; Speech; Tin;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.859329
  • Filename
    859329