DocumentCode
2076164
Title
Semantic Learning for Audio Applications: A Computer Vision Approach
Author
Sukthankar, Rahul ; Ke, Yan ; Hoiem, Derek
Author_Institution
Intel Research Pittsburgh, Carnegie Mellon
fYear
2006
fDate
17-22 June 2006
Firstpage
112
Lastpage
112
Abstract
Recent work in machine learning has significantly benefited semantic extraction tasks in computer vision, particularly for object recognition and image retrieval. We argue that the computer vision techniques that have been successfully applied in those settings can effectively be translated to other domains, such as audio. This claim is supported by recent results in music vs. speech classification, structure from sound, robust music identification and sound object recognition. This paper focuses on two such audio applications and demonstrates how ideas from computer vision map naturally to these problems.
Keywords
Acoustic noise; Application software; Computer vision; Image analysis; Machine learning; Music information retrieval; Object detection; Object recognition; Robustness; Speech analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition Workshop, 2006. CVPRW '06. Conference on
Print_ISBN
0-7695-2646-2
Type
conf
DOI
10.1109/CVPRW.2006.191
Filename
1640555
Link To Document