• DocumentCode
    1873663
  • Title

    An approximate algorithm for solving oracular POMDPs

  • Author

    Armstrong-Crews, Nicholas ; Veloso, Manuela

  • Author_Institution
    Robot. Inst., Carnegie Mellon Univ., Pittsburgh, PA
  • fYear
    2008
  • fDate
    19-23 May 2008
  • Firstpage
    3346
  • Lastpage
    3352
  • Abstract
    We propose a new approximate algorithm, LA- JIV (lookahead J-MDP information value), to solve oracular partially observable Markov decision problems (OPOMDPs), a special type of POMDP that rather than standard observations includes an "oracle" that can be consulted for full state information at a fixed cost. We previously introduced JIV (J-MDP information value) to solve OPOMDPs, an heuristic algorithm that utilizes the solution of the underlying MDP and weighs the value of consulting the oracle against the value of taking a state-modifying action. While efficient, JIV will rarely find the optimal solution. In this paper, we extend JIV to include lookahead, thereby permitting arbitrarily small deviation from the optimal policy\´s long-term expected reward at the cost of added computation time. The depth of the lookahead is a parameter that governs this tradeoff; by iteratively increasing this depth, we provide an anytime algorithm that yields an ever- improving solution. LA-JIV leverages the OPOMDP framework\´s unique characteristics to outperform general-purpose approximate POMDP solvers; in fact, we prove that LA-JIV is a poly-time approximation scheme (PTAS) with respect to the size of the state and observation spaces, thereby showing rigorously that OPOMDPs are "easier" than POMDPs. Finally, we substantiate our theoretical results via an empirical analysis of a benchmark OPOMDP instance.
  • Keywords
    Markov processes; approximation theory; decision making; intelligent robots; heuristic algorithm; oracular partially observable Markov decision problems; polytime approximation scheme; robots; time-varying stochastic process; Computer science; Costs; Heuristic algorithms; Medical robotics; Observability; Orbital robotics; Robot sensing systems; Robotics and automation; USA Councils; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robotics and Automation, 2008. ICRA 2008. IEEE International Conference on
  • Conference_Location
    Pasadena, CA
  • ISSN
    1050-4729
  • Print_ISBN
    978-1-4244-1646-2
  • Electronic_ISBN
    1050-4729
  • Type

    conf

  • DOI
    10.1109/ROBOT.2008.4543721
  • Filename
    4543721