• DocumentCode
    3078169
  • Title

    Automatic sound annotation

  • Author

    Cano, Pedro ; Koppenberger, Markus

  • Author_Institution
    Music Technol. Group, Universitat Pompeu Fabra, Barcelona
  • fYear
    2004
  • fDate
    Sept. 29 2004-Oct. 1 2004
  • Firstpage
    391
  • Lastpage
    400
  • Abstract
    Sound engineers need to access vast collections of sound effects for their film and video productions. Sound effects providers rely on text-retrieval techniques to offer their collections. Currently, annotation of audio content is done manually, which is an arduous task. Automatic annotation methods, normally fine-tuned to reduced domains such as musical instruments or reduced sound effects taxonomies, are not mature enough for labeling with great detail any possible sound. A general sound recognition tool would require: first, a taxonomy that represents the world and, second, thousands of classifiers, each specialized in distinguishing little details. We report experimental results on a general sound annotator. To tackle the taxonomy definition problem we use WordNet, a semantic network that organizes real world knowledge. In order to overcome the need of a huge number of classifiers to distinguish many different sound classes, we use a nearest-neighbor classifier with a database of isolated sounds unambiguously linked to WordNet concepts. A 30% concept prediction is achieved on a database of over 50,000 sounds and over 1600 concepts
  • Keywords
    acoustic signal processing; information retrieval; pattern classification; semantic networks; WordNet; audio content; automatic sound annotation; general sound recognition tool; musical instruments; nearest-neighbor classifier; real world knowledge; reduced sound effects taxonomies; semantic network; sound effects; taxonomy definition problem; text-retrieval techniques; Acoustical engineering; Electronic mail; Image databases; Instruments; Labeling; Libraries; Motion pictures; Production; Tail; Taxonomy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning for Signal Processing, 2004. Proceedings of the 2004 14th IEEE Signal Processing Society Workshop
  • Conference_Location
    Sao Luis
  • ISSN
    1551-2541
  • Print_ISBN
    0-7803-8608-4
  • Type

    conf

  • DOI
    10.1109/MLSP.2004.1422998
  • Filename
    1422998