• DocumentCode
    186272
  • Title

    Multiple spatio-temporal scales neural network for contextual visual recognition of human actions

  • Author

    Minju Jung ; Jungsik Hwang ; Jun Tani

  • Author_Institution
    KAIST, Daejeon, South Korea
  • fYear
    2014
  • fDate
    13-16 Oct. 2014
  • Firstpage
    235
  • Lastpage
    241
  • Abstract
    This paper introduces a novel dynamic neural network model which can recognize dynamic visual image patterns of human actions based on learning. The proposed model is characterized by its capability of extracting the spatio-temporal feature hierarchy latent in the training visual image streams. The model achieves this property by integrating two essential ideas: (1) multiple spatial-scales processing and (2) multiple timescales processing, which have been introduced the convolutional neural network (CNN) and the multiple timescale recurrent neural network (MTRNN), respectively. The evaluation of the model performance conducted by utilizing the Weizmann dataset showed that the proposed model outperforms other neural network models in recognition of a set of prototypical human movement patterns. Furthermore, additional evaluation testing for recognition of concatenated sequences of these prototypical movement patterns indicates that the model is endowed with a remarkable capability for contextual recognition of long-range dynamic visual patterns.
  • Keywords
    convolution; feature extraction; image motion analysis; image recognition; learning (artificial intelligence); recurrent neural nets; CNN; MTRNN; contextual visual recognition; convolutional neural network; deep learning; feature extraction; human action recognition; human movement patterns; multiple timescale recurrent neural network; Accuracy; Image recognition; Kernel; Neurons; Pattern recognition; Three-dimensional displays; Visualization; Convolutional neural network; deep learning; delay response manner; dynamic vision; multiple timescale recurrent neural network; self-organization; spatio-temporal hierarchy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Development and Learning and Epigenetic Robotics (ICDL-Epirob), 2014 Joint IEEE International Conferences on
  • Conference_Location
    Genoa
  • Type

    conf

  • DOI
    10.1109/DEVLRN.2014.6982987
  • Filename
    6982987