• DocumentCode
    2184658
  • Title

    Toward understanding natural language directions

  • Author

    Kollar, Thomas ; Tellex, Stefanie ; Roy, Deb ; Roy, Nicholas

  • Author_Institution
    Stata Center, MIT CSAIL, Cambridge, MA, USA
  • fYear
    2010
  • fDate
    2-5 March 2010
  • Firstpage
    259
  • Lastpage
    266
  • Abstract
    Speaking using unconstrained natural language is an intuitive and flexible way for humans to interact with robots. Understanding this kind of linguistic input is challenging because diverse words and phrases must be mapped into structures that the robot can understand, and elements in those structures must be grounded in an uncertain environment. We present a system that follows natural language directions by extracting a sequence of spatial description clauses from the linguistic input and then infers the most probable path through the environment given only information about the environmental geometry and detected visible objects. We use a probabilistic graphical model that factors into three key components. The first component grounds landmark phrases such as ¿the computers¿ in the perceptual frame of the robot by exploiting co-occurrence statistics from a database of tagged images such as Flickr. Second, a spatial reasoning component judges how well spatial relations such as ¿past the computers¿ describe a path. Finally, verb phrases such as ¿turn right¿ are modeled according to the amount of change in orientation in the path. Our system follows 60% of the directions in our corpus to within 15 meters of the true destination, significantly outperforming other approaches.
  • Keywords
    computational linguistics; human-robot interaction; natural languages; probability; Flickr; cooccurrence statistics; human-robot interaction; linguistics; natural language; probabilistic graphical model; Computational geometry; Data mining; Graphical models; Human robot interaction; Image databases; Information geometry; Natural languages; Object detection; Object oriented databases; Statistics; direction understanding; route instructions; spatial language;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Human-Robot Interaction (HRI), 2010 5th ACM/IEEE International Conference on
  • Conference_Location
    Osaka
  • Print_ISBN
    978-1-4244-4892-0
  • Electronic_ISBN
    978-1-4244-4893-7
  • Type

    conf

  • DOI
    10.1109/HRI.2010.5453186
  • Filename
    5453186