DocumentCode :
3601479
Title :
Transductive Multi-View Zero-Shot Learning
Author :
Yanwei Fu ; Hospedales, Timothy M. ; Tao Xiang ; Shaogang Gong
Author_Institution :
Disney Res., Pittsburgh, PA, USA
Volume :
37
Issue :
11
fYear :
2015
Firstpage :
2332
Lastpage :
2345
Abstract :
Most existing zero-shot learning approaches exploit transfer learning via an intermediate semantic representation shared between an annotated auxiliary dataset and a target dataset with different classes and no annotation. A projection from a low-level feature space to the semantic representation space is learned from the auxiliary dataset and applied without adaptation to the target dataset. In this paper we identify two inherent limitations with these approaches. First, due to having disjoint and potentially unrelated classes, the projection functions learned from the auxiliary dataset/domain are biased when applied directly to the target dataset/ domain. We call this problem the projection domain shift problem and propose a novel framework, transductive multi-view embedding, to solve it. The second limitation is the prototype sparsity problem which refers to the fact that for each target class, only a single prototype is available for zero-shot learning given a semantic representation. To overcome this problem, a novel heterogeneous multi-view hypergraph label propagation method is formulated for zero-shot learning in the transductive embedding space. It effectively exploits the complementary information offered by different semantic representations and takes advantage of the manifold structures of multiple representation spaces in a coherent manner. We demonstrate through extensive experiments that the proposed approach (1) rectifies the projection shift between the auxiliary and target domains, (2) exploits the complementarity of multiple semantic representations, (3) significantly outperforms existing methods for both zero-shot and N-shot recognition on three image and video benchmark datasets, and (4) enables novel cross-view annotation tasks.
Keywords :
graph theory; learning (artificial intelligence); object recognition; semantic networks; N-shot recognition; auxiliary dataset; auxiliary domain; complementary information; heterogeneous multiview hypergraph label propagation method; image video benchmark dataset; manifold structure; projection domain shift problem; projection function; projection shift; prototype sparsity problem; semantic representation space; target dataset; target domain; transductive embedding space; transductive multiview embedding; transductive multiview zero-shot learning; transfer learning; zero-shot recognition; Manifolds; Pragmatics; Prototypes; Semantics; Target recognition; Vectors; Visualization; Transducitve learning; heterogeneous hypergraph; multi-view Learning; transfer Learning; zero-shot Learning;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2015.2408354
Filename :
7053935
Link To Document :
بازگشت