مرکز منطقه ای اطلاع رساني علوم و فناوري - On Stochastic Feedback Control for Multi-Antenna Beamforming: Formulation and Low-Complexity Algorithms

DocumentCode :

83560

Title :

On Stochastic Feedback Control for Multi-Antenna Beamforming: Formulation and Low-Complexity Algorithms

Author :

Sun Sun ; Min Dong ; Ben Liang

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of Toronto, Toronto, ON, Canada

Volume :

Issue :

fYear :

2014

fDate :

Sept. 2014

Firstpage :

4731

Lastpage :

4745

Abstract :

Based on the Gauss-Markov channel model, we investigate the stochastic feedback control for transmit beamforming in multiple-input-single-output systems and design practical implementation algorithms leveraging techniques in dynamic programming and reinforcement learning. We first validate the Markov decision process formulation of the underlying feedback control problem with a 4R-variable (4R-V) state, where R is the number of the transmit antennas. Due to the high complexity of finding an optimal feedback policy under the 4R-V state, we consider a reduced 2-V state. As opposed to a previous study that assumes the feedback problem under such a 2-V state remaining an MDP formulation, our analysis indicates that the underlying problem is no longer an MDP. Nonetheless, the approximation as an MDP is shown to be justifiable and efficient. Based on the quantized 2-V state and the MDP approximation, we propose practical implementation algorithms for feedback control with unknown state transition probabilities. In particular, we provide model-based offline and online learning algorithms, as well as a model-free learning algorithm. We investigate and compare these algorithms through extensive simulations and provide their efficiency analysis. According to these results, the application rule of these algorithms is established under both statistically stable and unstable channels.

Keywords :

MIMO communication; Markov processes; antenna arrays; approximation theory; array signal processing; feedback; learning (artificial intelligence); optimal control; stochastic processes; telecommunication computing; telecommunication control; transmitting antennas; 4R-V state; Gauss-Markov channel model; MDP approximation; MDP formulation; Markov decision process; dynamic programming; model-based offline learning algorithm; model-free learning algorithm; multiantenna beamforming; multiple-input-single-output system; online learning algorithm; optimal feedback policy; reinforcement learning; stochastic feedback control; transmit antennas; Algorithm design and analysis; Array signal processing; Correlation; Feedback control; Receivers; Transmitters; Vectors; Beamforming; implementation algorithms; reinforcement learning; stochastic feedback control;

fLanguage :

English

Journal_Title :

Wireless Communications, IEEE Transactions on

Publisher :

ieee

ISSN :

1536-1276

Type :

jour

DOI :

10.1109/TWC.2014.2336661

Filename :

6849980

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=83560