DocumentCode :
1763622
Title :
Learning with Hierarchical-Deep Models
Author :
Salakhutdinov, R. ; Tenenbaum, J.B. ; Torralba, Antonio
Author_Institution :
Dept. of Stat. & Comput. Sci., Univ. of Toronto, Toronto, ON, Canada
Volume :
35
Issue :
8
fYear :
2013
fDate :
Aug. 2013
Firstpage :
1958
Lastpage :
1971
Abstract :
We introduce HD (or “Hierarchical-Deep”) models, a new compositional learning architecture that integrates deep learning models with structured hierarchical Bayesian (HB) models. Specifically, we show how we can learn a hierarchical Dirichlet process (HDP) prior over the activities of the top-level features in a deep Boltzmann machine (DBM). This compound HDP-DBM model learns to learn novel concepts from very few training example by learning low-level generic features, high-level features that capture correlations among low-level features, and a category hierarchy for sharing priors over the high-level features that are typical of different kinds of concepts. We present efficient learning and inference algorithms for the HDP-DBM model and show that it is able to learn new concepts from very few examples on CIFAR-100 object recognition, handwritten character recognition, and human motion capture datasets.
Keywords :
Bayes methods; Boltzmann machines; handwritten character recognition; image motion analysis; inference mechanisms; learning (artificial intelligence); object recognition; CIFAR-100 object recognition; HD model; compositional learning architecture; compound HDP-DBM model; deep Boltzmann machine; handwritten character recognition; hierarchical Dirichlet process; hierarchical-deep model; high-level features; human motion capture datasets; low-level generic features; structured hierarchical Bayesian model; Approximation methods; Bayesian methods; Computational modeling; Machine learning; Stochastic processes; Training; Vectors; Deep networks; deep Boltzmann machines; hierarchical Bayesian models; one-shot learning; Algorithms; Artificial Intelligence; Bayes Theorem; Humans; Motion; Pattern Recognition, Automated; Visual Perception;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2012.269
Filename :
6389680
Link To Document :
بازگشت