مرکز منطقه ای اطلاع رساني علوم و فناوري - Evaluating effectiveness and portability of reinforcement learned dialogue strategies with real users: the talk TownInfo evaluation

DocumentCode :

1858451

Title :

Evaluating effectiveness and portability of reinforcement learned dialogue strategies with real users: the talk TownInfo evaluation

Author :

Lemon, O. ; Georgila, K. ; Henderson, J.

Author_Institution :

Sch. of Inf., Edinburgh Univ., Edinburgh

fYear :

2006

fDate :

10-13 Dec. 2006

Firstpage :

178

Lastpage :

181

Abstract :

We report evaluation results for real users of a learnt dialogue management policy versus a hand-coded policy in the TALK project\´s "Townlnfo" tourist information system. The learnt policy, for filling and confirming information slots, was derived from COMMUNICATOR (flight-booking) data using reinforcement learning (RL) as described in [2], ported to the tourist information domain (using a general method that we propose here), and tested using 18 human users in 180 dialogues, who also used a state-of-the-art hand- coded dialogue policy embedded in an otherwise identical system. We found that users of the (ported) learned policy had an average gain in perceived task completion of 14.2% (from 67.6% to 81.8% at p < .03), that the hand-coded policy dialogues had on average 3.3 more system turns (p < .01), and that the user satisfaction results were comparable, even though the policy was learned for a different domain. Combining these in a dialogue reward score, we found a 14.4% increase for the learnt policy (a 23.8% relative increase, p < .03). These results are important because they show a) that results for real users are consistent with results for automatic evaluation [2] of learned policies using simulated users [3, 4], b) that a policy learned using linear function approximation over a very large policy space [2] is effective for real users, and c) that policies learned using data for one domain can be used successfully in other domains. We also present a qualitative discussion of the learnt policy.

Keywords :

function approximation; information systems; interactive systems; learning (artificial intelligence); natural language interfaces; travel industry; Talk Towninfo evaluation; Townlnfo tourist information system; hand-coded policy; learnt dialogue management policy; linear function approximation; reinforcement learned dialogue strategies; Filling; Function approximation; Graphical user interfaces; Humans; Informatics; Learning; Management information systems; Natural languages; Project management; System testing;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Spoken Language Technology Workshop, 2006. IEEE

Conference_Location :

Palm Beach

Print_ISBN :

1-4244-0872-5

Type :

conf

DOI :

10.1109/SLT.2006.326774

Filename :

4123391

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1858451