• DocumentCode
    1871767
  • Title

    Anticipatory robot control for a partially observable environment using episodic memories

  • Author

    Endo, Yoichiro

  • Author_Institution
    Georgia Tech Mobile Robot Lab., Atlanta, GA
  • fYear
    2008
  • fDate
    19-23 May 2008
  • Firstpage
    2852
  • Lastpage
    2859
  • Abstract
    This paper explains an episodic-memory-based approach for computing anticipatory robot behavior in a partially observable environment. Inspired by biological findings on the mammalian hippocampus, here, episodic memories retain a sequence of experienced observation, behavior, and reward. Incorporating multiple machine learning methods, this approach attempts to help reducing the computational burden of a partially observable Markov decision process (POMDP) problem. In particular, proposed computational reduction techniques include: (1) abstraction of the state space via temporal difference learning; (2) abstraction of the action space by utilizing motor schemata; (3) narrowing down the state space in terms of goals through instance-based learning; (4) elimination of the value-iteration by assuming a unidirectional-linear-chaining formation of the state space; (5) reduction of the state-estimate computation by exploiting the property of the Poisson distribution; and (6) trimming the history length by imposing a cap on the number of episodes that are computed. Claims (5) and (6) were empirically verified, and it was confirmed that the state estimation can be in fact computed in an 0(n) time (where n is the number of the states), more efficient than a conventional Kalman-filter based approach of 0(n2).
  • Keywords
    Markov processes; Poisson distribution; robots; Poisson distribution; anticipatory robot control; computational reduction techniques; episodic memories; instance-based learning; mammalian hippocampus; multiple machine learning methods; partially observable Markov decision process; partially observable environment; temporal difference learning; unidirectional-linear-chaining formation; Biology computing; Cognitive robotics; Distributed computing; Humans; Medical services; Orbital robotics; Robot control; Robotics and automation; State estimation; State-space methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation, 2008. ICRA 2008. IEEE International Conference on
  • Conference_Location
    Pasadena, CA
  • ISSN
    1050-4729
  • Print_ISBN
    978-1-4244-1646-2
  • Electronic_ISBN
    1050-4729
  • Type

    conf

  • DOI
    10.1109/ROBOT.2008.4543642
  • Filename
    4543642