• DocumentCode
    3019501
  • Title

    Driving me around the bend: Learning to drive from visual gist

  • Author

    Pugeault, Nicolas ; Bowden, Richard

  • Author_Institution
    Centre for Vision, Speech & Signal Process., Univ. of Surrey, Guildford, UK
  • fYear
    2011
  • fDate
    6-13 Nov. 2011
  • Firstpage
    1022
  • Lastpage
    1029
  • Abstract
    This article proposes an approach to learning steering and road following behaviour from a human driver using holistic visual features. We use a random forest (RF) to regress a mapping between these features and the driver´s actions, and propose an alternative to classical random forest regression based on the Medoid (RF-Medoid), that reduces the underestimation of extreme control values. We compare prediction performance using different holistic visual descriptors: GIST, Channel-GIST (C-GIST) and Pyramidal-HOG (P-HOG). The proposed methods are evaluated on two different datasets: predicting human behaviour on countryside roads and also for autonomous control of a robot on an indoor track. We show that 1) C-GIST leads to the best predictions on both sequences, and 2) RF-Medoid leads to a better estimation of extreme values, where a classical RF tends to under-steer. We use around 10% of the data for training and show excellent generalization over a dataset of thousands of images. Importantly, we do not engineer the solution but instead use machine learning to automatically identify the relationship between visual features and behaviour, providing an efficient, generic solution to autonomous control.
  • Keywords
    feature extraction; regression analysis; road traffic; traffic engineering computing; Channel-GIST visual descriptor; Medoid; Pyramidal-HOG visual descriptor; histogram-of-gradients; human behaviour prediction; human driver; random forest regression; road following behaviour; robot control; steering learning behavior; visual feature; visual gist; Regression tree analysis; Roads; Training; Vectors; Vegetation; Vehicles; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision Workshops (ICCV Workshops), 2011 IEEE International Conference on
  • Conference_Location
    Barcelona
  • Print_ISBN
    978-1-4673-0062-9
  • Type

    conf

  • DOI
    10.1109/ICCVW.2011.6130363
  • Filename
    6130363