A simulation-based method for aggregating Markov chains

Author

Deng, Kun ; Mehta, Prashant G. ; Meyn, Sean P.

Author_Institution

Coordinated Sci. Lab., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA

fYear

2009

fDate

15-18 Dec. 2009

Firstpage

4710

Lastpage

4716

Abstract

This paper addresses model reduction for a Markov chain on a large state space. A simulation-based framework is introduced to perform state aggregation of the Markov chain based on observations of a single sample path. The Kullback-Leibler (K-L) divergence rate is employed as a metric to measure the distance between two stationary Markov chains. Model reduction with respect to this metric is cast as an infinite-horizon average cost optimal control problem. In this way an optimal policy corresponds to an optimal partition of the state space with respect to the K-L divergence rate. The optimal control problem is simplified in an approximate dynamic programming (ADP) framework: First, a relaxation of the policy space is performed, and based on this a parameterization of the set of optimal policies is introduced. This makes possible a stochastic approximation approach to compute the best policy within a given parameterized class. The algorithm can be implemented using a single sample path of the Markov chain. Convergence is established using the ODE method. Examples illustrate the theoretical results, and show remarkably low variance and fast convergence.

Keywords

Markov processes; cost optimal control; dynamic programming; infinite horizon; reduced order systems; state-space methods; Kullback-Leibler divergence rate; ODE method; aggregating Markov chains; approximate dynamic programming; infinite-horizon average cost optimal control; model reduction; optimal partition; optimal policy; simulation-based framework; simulation-based method; state aggregation; state space; stochastic approximation approach; Application software; Biological system modeling; Computational modeling; Convergence; Cost function; Dynamic programming; Optimal control; Reduced order systems; State-space methods; Stochastic processes;

fLanguage

English

Publisher

ieee

Conference_Titel

Decision and Control, 2009 held jointly with the 2009 28th Chinese Control Conference. CDC/CCC 2009. Proceedings of the 48th IEEE Conference on

Conference_Location

Shanghai

ISSN

0191-2216

Print_ISBN

978-1-4244-3871-6

Electronic_ISBN

0191-2216

Type

conf

DOI

10.1109/CDC.2009.5400533

Filename

5400533