DocumentCode :
3401504
Title :
Unsupervised learning of invariant features using video
Author :
Stavens, David ; Thrun, Sebastian
Author_Institution :
Comput. Sci. Dept., Stanford Artificial Intell. Lab., Stanford, CA, USA
fYear :
2010
fDate :
13-18 June 2010
Firstpage :
1649
Lastpage :
1656
Abstract :
We present an algorithm that learns invariant features from real data in an entirely unsupervised fashion. The principal benefit of our method is that it can be applied without human intervention to a particular application or data set, learning the specific invariances necessary for excellent feature performance on that data. Our algorithm relies on the ability to track image patches over time using optical flow. With the wide availability of high frame rate video (eg: on the web, from a robot), good tracking is straightforward to achieve. The algorithm then optimizes feature parameters such that patches corresponding to the same physical location have feature descriptors that are as similar as possible while simultaneously maximizing the distinctness of descriptors for different locations. Thus, our method captures data or application specific invariances yet does not require any manual supervision. We apply our algorithm to learn domain-optimized versions of SIFT and HOG. SIFT and HOG features are excellent and widely used. However, they are general and by definition not tailored to a specific domain. Our domain-optimized versions offer a substantial performance increase for classification and correspondence tasks we consider. Furthermore, we show that the features our method learns are near the optimal that would be achieved by directly optimizing the test set performance of a classifier. Finally, we demonstrate that the learning often allows fewer features to be used for some tasks, which has the potential to dramatically improve computational concerns for very large data sets.
Keywords :
feature extraction; image sequences; unsupervised learning; video signal processing; HOG feature; SIFT feature; feature descriptors; high frame rate video; image patches; invariant features learning; optical flow; unsupervised learning; Artificial intelligence; Computer science; Image motion analysis; Laboratories; Machine learning algorithms; Optimization methods; Streaming media; Surgery; Testing; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
Conference_Location :
San Francisco, CA
ISSN :
1063-6919
Print_ISBN :
978-1-4244-6984-0
Type :
conf
DOI :
10.1109/CVPR.2010.5539773
Filename :
5539773
Link To Document :
بازگشت