مرکز منطقه ای اطلاع رساني علوم و فناوري - A new method of concurrently visualizing states, values, and actions in reinforcement based brain machine interfaces

DocumentCode :

3207832

Title :

A new method of concurrently visualizing states, values, and actions in reinforcement based brain machine interfaces

Author :

Jihye Bae ; Sanchez Giraldo, Luis Gonzalo ; Pohlmeyer, E.A. ; Sanchez, J.C. ; Principe, Jose C.

Author_Institution :

Dept. of Electr. & Comput. Eng., Univ. of Florida, Gainesville, FL, USA

fYear :

2013

fDate :

3-7 July 2013

Firstpage :

5402

Lastpage :

5405

Abstract :

This paper presents the first attempt to quantify the individual performance of the subject and of the computer agent on a closed loop Reinforcement Learning Brain Machine Interface (RLBMI). The distinctive feature of the RLBMI architecture is the co-adaptation of two systems (a BMI decoder in agent and a BMI user in environment). In this work, an agent implemented using Q-learning via kernel temporal difference (KTD)(λ) decodes the neural states of a monkey and transforms them into action directions of a robotic arm. We analyze how each participant influences the overall performance both in successful and missed trials by visualizing states, corresponding action value Q, and resulting actions in two-dimensional space. With the proposed methodology, we can observe how the decoder effectively learns a good state to action mapping, and how neural states affect the prediction performance.

Keywords :

biocontrol; brain-computer interfaces; learning (artificial intelligence); BMI decoder; Q-learning; closed loop RLBMI; closed loop Reinforcement Learning Brain Machine Interface; concurrent visualization; kernel temporal difference; monkey; neural states; reinforcement based brain machine interfaces; robotic arm; Decoding; Kernel; Learning (artificial intelligence); Principal component analysis; Robots; Vectors; Visualization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE

Conference_Location :

Osaka

ISSN :

1557-170X

Type :

conf

DOI :

10.1109/EMBC.2013.6610770

Filename :

6610770

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3207832