DocumentCode
586574
Title
Learning to predict action outcomes in continuous, relational environments
Author
Palmer, T.J. ; Bodenhamer, Matthew ; Fagg, Andrew H.
Author_Institution
Symbiotic Comput. Lab., Univ. of Oklahoma, Norman, OK, USA
fYear
2012
fDate
7-9 Nov. 2012
Firstpage
1
Lastpage
7
Abstract
We present a method for predicting action outcomes in unstructured environments with variable numbers of participants and hidden relationships between them. For example, when pouring flour from a cup into a mixing bowl, important relations must exist between the cup and the bowl. The action Pour(x, y) might depend on the precondition Above(x, y). How well the predicate Above actually predicts action success often depends on complicated world dynamics and perhaps other objects in the scene. While such predicates are commonly hand-crafted, we present in this paper a method for learning physically grounded predicates directly from the continuous data. In this manner, an agent´s own developmental experience can drive its world representations. Here, we learn such representations as ensembles (or forests) of probability trees using the Spatiotemporal Multidimensional Relational Framework (SMRF). By reasoning about individual objects, SMRF trees allow us to focus attention on action parameters while still considering other objects in a scene. We demonstrate our method on three simulated problems. Two are in a blocks world with gravity: one predicting the tipping direction of a balance scale, inspired by Siegler´s classic cognitive psychology work, and the second predicting the success of dropping one block on another. The third problem predicts the success of passing a ball in a soccer domain. In these tasks, we show an ability to scale prediction to scenes with more objects than are present in the training data.
Keywords
network theory (graphs); probability; small-world networks; trees (mathematics); Siegler classic cognitive psychology work; action Pour(x, y) might; action outcomes; complicated world dynamics; continuous; developmental experience; physically grounded predicates; precondition Above(x, y); probability trees; relational environments; spatiotemporal multidimensional relational framework; unstructured environments;
fLanguage
English
Publisher
ieee
Conference_Titel
Development and Learning and Epigenetic Robotics (ICDL), 2012 IEEE International Conference on
Conference_Location
San Diego, CA
Print_ISBN
978-1-4673-4964-2
Electronic_ISBN
978-1-4673-4963-5
Type
conf
DOI
10.1109/DevLrn.2012.6400869
Filename
6400869
Link To Document