DocumentCode :
1502985
Title :
Online learning in autonomic multi-hop wireless networks for transmitting mission-critical applications
Author :
Shiang, Hsien-Po ; Van der Schaar, Mihaela
Author_Institution :
Dept. of Electr. Eng., Univ. of California, Los Angeles, CA, USA
Volume :
28
Issue :
5
fYear :
2010
fDate :
6/1/2010 12:00:00 AM
Firstpage :
728
Lastpage :
741
Abstract :
In this paper, we study how to optimize the transmission decisions of nodes aimed at supporting mission-critical applications, such as surveillance, security monitoring, and military operations, etc. We focus on a network scenario where multiple source nodes transmit simultaneously mission-critical data through relay nodes to one or multiple destinations in multi-hop wireless Mission-Critical Networks (MCN). In such a network, the wireless nodes can be modeled as agents that can acquire local information from their neighbors and, based on this available information, can make timely transmission decisions to minimize the end-to-end delays of the mission-critical applications. Importantly, the MCN needs to cope in practice with the time-varying network dynamics. Hence, the agents need to make transmission decisions by considering not only the current network status, but also how the network status evolves over time, and how this is influenced by the actions taken by the nodes. We formulate the agents´ autonomic decision making problem as a Markov decision process (MDP) and construct a distributed MDP framework, which takes into consideration the informationally-decentralized nature of the multi-hop MCN. We further propose an online model-based reinforcement learning approach for agents to solve the distributed MDP at runtime, by modeling the network dynamics using priority queuing. We compare the proposed model-based reinforcement learning approach with other model-free reinforcement learning approaches in the MCN. The results show that the proposed model-based reinforcement learning approach for mission-critical applications not only outperforms myopic approaches without learning capability, but also outperforms conventional model-free reinforcement learning approaches.
Keywords :
Markov processes; learning (artificial intelligence); queueing theory; time-varying channels; Markov decision process; autonomic multi-hop wireless networks; mission-critical applications; online learning; priority queuing; reinforcement learning approach; time-varying network dynamics; Communication system security; Data security; Information security; Learning; Mission critical systems; Monitoring; Relays; Spread spectrum communication; Surveillance; Wireless networks; Autonomic multi-hop wireless networks; distributed Markov decision process; multi-user mission-critical transmission; online reinforcement learning;
fLanguage :
English
Journal_Title :
Selected Areas in Communications, IEEE Journal on
Publisher :
ieee
ISSN :
0733-8716
Type :
jour
DOI :
10.1109/JSAC.2010.100610
Filename :
5472428
Link To Document :
بازگشت