DocumentCode
3244751
Title
Issues in the evaluation of spoken dialogue systems using objective and subjective measures
Author
Larsen, Lars Bo
Author_Institution
Dept. of Commun. Technol., Aalborg Univ., Denmark
fYear
2003
fDate
30 Nov.-3 Dec. 2003
Firstpage
209
Lastpage
214
Abstract
The paper presents results and conclusions about the current evaluation methodologies for spoken dialogue systems (SDS). The PARADISE paradigm, used for evaluation in the DARPA Communicator project, is briefly introduced and discussed through the application to the OVID home banking dialogue system. It is shown to provide results consistent with those obtained by the DARPA community, but a number of problems and limitations are pointed out. The issue of user attitude measures obtained through questionnaires is discussed. This is an area that has not received much attention from the speech technology community, but is important in order to obtain valid results and conclusions about usability. A general presentation of the issues that must be addressed when developing and employing questionnaires is given with a focus on how to ensure the reliability and validity of the results. Examples of results obtained from the OVID project are used to illustrate this.
Keywords
human computer interaction; human factors; interactive systems; natural language interfaces; speech recognition; speech-based user interfaces; DARPA Communicator project; PARADISE paradigm; evaluation methodologies; home banking dialogue system; objective measures; questionnaires; spoken dialogue systems; subjective measures; Banking; Communications technology; Multimedia communication; Multimedia systems; Privacy; Sliding mode control; Speech analysis; Speech recognition; System performance; Usability;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2003. ASRU '03. 2003 IEEE Workshop on
Print_ISBN
0-7803-7980-2
Type
conf
DOI
10.1109/ASRU.2003.1318442
Filename
1318442
Link To Document