• DocumentCode
    2076164
  • Title

    Semantic Learning for Audio Applications: A Computer Vision Approach

  • Author

    Sukthankar, Rahul ; Ke, Yan ; Hoiem, Derek

  • Author_Institution
    Intel Research Pittsburgh, Carnegie Mellon
  • fYear
    2006
  • fDate
    17-22 June 2006
  • Firstpage
    112
  • Lastpage
    112
  • Abstract
    Recent work in machine learning has significantly benefited semantic extraction tasks in computer vision, particularly for object recognition and image retrieval. We argue that the computer vision techniques that have been successfully applied in those settings can effectively be translated to other domains, such as audio. This claim is supported by recent results in music vs. speech classification, structure from sound, robust music identification and sound object recognition. This paper focuses on two such audio applications and demonstrates how ideas from computer vision map naturally to these problems.
  • Keywords
    Acoustic noise; Application software; Computer vision; Image analysis; Machine learning; Music information retrieval; Object detection; Object recognition; Robustness; Speech analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition Workshop, 2006. CVPRW '06. Conference on
  • Print_ISBN
    0-7695-2646-2
  • Type

    conf

  • DOI
    10.1109/CVPRW.2006.191
  • Filename
    1640555