DocumentCode
2397058
Title
Case based Indonesian closed domain question answering system with real world questions
Author
Fikri, Abdurrisyad ; Purwarianti, Ayu
Author_Institution
Sch. of Electr. Eng. & Inf., Inst. Teknol. Bandung (ITB), Bandung, Indonesia
fYear
2012
fDate
30-31 Oct. 2012
Firstpage
181
Lastpage
186
Abstract
Number of people having expertise in a certain domain is less than people who need information in that domain. In this situation, an automatic question answering (QA) system is necessary. Observing available manual QA sites on internet, the real world question that people usually ask have different expected answer type (EAT) compared to a common automatic QA. Addressing a case study of a religion domain which makes it a closed domain QA, we proposed the EAT into 6 types: LAW, DEFINITION, COMPARISON, METHOD, TIME and PERSON. Different with common QA approach, we built the QA system using case based approach which consists of two main components: Question Analyzer and Case Retriever. Related with the case based reasoning (CBR) framework, these two main components act as the Retrieve and Reuse process while the Revise and Retain process is handle by Case Retainer component. The QA system was built using available Indonesian Natural Language Processing (NLP) tools and FreeCBR as the CBR library. The experiments were done to calculate the accuracy and testing the system with unknown case. By using 77 cases collected from internet with assumption that all answers are available, the experiments achieved 97% accuracy. And by using 10 test cases for the unknown case, the similarity score calculated by the system showed that the test questions have no answer in the available case base.
Keywords
case-based reasoning; natural language processing; question answering (information retrieval); CBR framework; CBR library; EAT; FreeCBR; Indonesian NLP tool; Indonesian natural language processing tool; Internet; automatic QA system; automatic question answering system; case retainer component; case retriever; case-based Indonesian closed domain question answering system; case-based reasoning framework; comparison type; definition type; expected answer type; law type; method type; person type; question analyzer; real world question; religion domain; retrieve-reuse process; revise-retain process; time type; Accuracy; Cognition; Equations; Internet; Semantics; Telecommunications; Case Based Reasoning; Closed Domain; Question Answering System; Real World Questions;
fLanguage
English
Publisher
ieee
Conference_Titel
Telecommunication Systems, Services, and Applications (TSSA), 2012 7th International Conference on
Conference_Location
Bali
Print_ISBN
978-1-4673-4549-1
Type
conf
DOI
10.1109/TSSA.2012.6366047
Filename
6366047
Link To Document