مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervised learning of invariant features using video

DocumentCode :

3401504

Title :

Unsupervised learning of invariant features using video

Author :

Stavens, David ; Thrun, Sebastian

Author_Institution :

Comput. Sci. Dept., Stanford Artificial Intell. Lab., Stanford, CA, USA

fYear :

2010

fDate :

13-18 June 2010

Firstpage :

1649

Lastpage :

1656

Abstract :

We present an algorithm that learns invariant features from real data in an entirely unsupervised fashion. The principal benefit of our method is that it can be applied without human intervention to a particular application or data set, learning the specific invariances necessary for excellent feature performance on that data. Our algorithm relies on the ability to track image patches over time using optical flow. With the wide availability of high frame rate video (eg: on the web, from a robot), good tracking is straightforward to achieve. The algorithm then optimizes feature parameters such that patches corresponding to the same physical location have feature descriptors that are as similar as possible while simultaneously maximizing the distinctness of descriptors for different locations. Thus, our method captures data or application specific invariances yet does not require any manual supervision. We apply our algorithm to learn domain-optimized versions of SIFT and HOG. SIFT and HOG features are excellent and widely used. However, they are general and by definition not tailored to a specific domain. Our domain-optimized versions offer a substantial performance increase for classification and correspondence tasks we consider. Furthermore, we show that the features our method learns are near the optimal that would be achieved by directly optimizing the test set performance of a classifier. Finally, we demonstrate that the learning often allows fewer features to be used for some tasks, which has the potential to dramatically improve computational concerns for very large data sets.

Keywords :

feature extraction; image sequences; unsupervised learning; video signal processing; HOG feature; SIFT feature; feature descriptors; high frame rate video; image patches; invariant features learning; optical flow; unsupervised learning; Artificial intelligence; Computer science; Image motion analysis; Laboratories; Machine learning algorithms; Optimization methods; Streaming media; Surgery; Testing; Unsupervised learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on

Conference_Location :

San Francisco, CA

ISSN :

1063-6919

Print_ISBN :

978-1-4244-6984-0

Type :

conf

DOI :

10.1109/CVPR.2010.5539773

Filename :

5539773

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3401504