• DocumentCode
    3606186
  • Title

    DeepBag: Recognizing Handbag Models

  • Author

    Yan Wang ; Sheng Li ; Kot, Alex C.

  • Author_Institution
    Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    17
  • Issue
    11
  • fYear
    2015
  • Firstpage
    2072
  • Lastpage
    2083
  • Abstract
    In this paper, we address the problem of branded handbag recognition. It is a challenging problem due to the non-rigid deformation, illumination changes, and inter-class similarity. We propose a novel framework based on deep convolutional neural network (CNN). Concretely, we propose a new CNN model, called feature selective joint classification - regression CNN (FSCR-CNN). Its advantages lie in two folds: 1) it alleviates the illumination changes by a feature selection strategy to focus on the color- nondiscriminative features in the network learning, and 2) rather than only targeting on the hard label (i.e., the handbag model), it also incorporates a soft label (i.e., a distribution measuring the similarity between the ground truth model and all the models to be trained) to construct the loss function for training CNN, which leads to a better classifier for handbags with large inter-class similarity. We evaluate the performance of our framework on a newly built branded handbag dataset. The results show that it performs favorably for recognizing handbags with 94.48% in accuracy. We also apply the proposed FSCR-CNN model in recognizing other fine-grained objects with state-of-the-art CNN architectures, which is able to achieve over 5% improvement in accuracy.
  • Keywords
    feature selection; feedforward neural nets; image classification; image colour analysis; learning (artificial intelligence); lighting; object recognition; regression analysis; DeepBag; FSCR-CNN model; branded handbag recognition problem; deep convolutional neural network; feature selection strategy; feature selective joint classification-regression CNN model; ground truth model; illumination changes; interclass similarity; loss function; network learning; nonrigid deformation; similarity distribution measurement; Computer architecture; Image color analysis; Image recognition; Lighting; Object recognition; Power capacitors; Training; Convolutional neural networks; feature selection; handbag recognition; soft label;
  • fLanguage
    English
  • Journal_Title
    Multimedia, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1520-9210
  • Type

    jour

  • DOI
    10.1109/TMM.2015.2480228
  • Filename
    7272090