• DocumentCode
    1660935
  • Title

    Approximating Model Equivalence in Interactive Dynamic Influence Diagrams Using Top K Policy Paths

  • Author

    Zeng, Yifeng ; Chen, Yingke ; Doshi, Prashant

  • Author_Institution
    Dept. of Comput. Sci., Aalborg Univ., Aalborg, Denmark
  • Volume
    2
  • fYear
    2011
  • Firstpage
    208
  • Lastpage
    211
  • Abstract
    Interactive dynamic influence diagrams (I-DIDs) are graphical models for sequential decision making in uncertain settings shared by other agents. Algorithms for solving I-DIDs face the challenge of an exponentially growing space of behavioral models ascribed to other agents over time. Previous approaches mainly cluster behaviorally equivalent models to reduce the complexity of I-DID solutions. In this paper, we seek to further reduce the model space by introducing an approximate measure of behavioral equivalence (BE) and using it to group models. Specifically, we focus on $K$ most probable paths in the solution of each model and compare these policy paths to determine approximate BE. We discuss the challenges in computing the top $K$ policy paths and experimentally evaluate the performance of this heuristic approach in terms of the scalability and quality of the solution.
  • Keywords
    approximation theory; decision making; graph theory; multi-agent systems; probability; I-DID; K most probable paths; agent modeling; behavioral equivalence; behavioral model; graphical model; group models; interactive dynamic influence diagram; model equivalence approximation; model space reduction; sequential decision making; top K policy paths; Approximation methods; Computational modeling; Computer science; Educational institutions; Games; Graphical models; Multiagent systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology (WI-IAT), 2011 IEEE/WIC/ACM International Conference on
  • Conference_Location
    Lyon
  • Print_ISBN
    978-1-4577-1373-6
  • Electronic_ISBN
    978-0-7695-4513-4
  • Type

    conf

  • DOI
    10.1109/WI-IAT.2011.79
  • Filename
    6040778