Title :
A review in feature extraction approach in question classification using Support Vector Machine
Author :
Sangodiah, Anbuselvan ; Ahmad, Rohiza ; Ahmad, Wan Fatimah Wan
Author_Institution :
Dept. of Inf. Syst., UTAR, Kampar, Malaysia
Abstract :
Text classification which is an integral part of text mining has caught much attention in various industries and fields recently. The ability is in assigning text documents to one or more pre-defined categories based on content similarity. While most of application of text classification focuses on document level, question classification works at much granular level such as sentence and phrase. There have been numerous studies on question classification in accordance to Bloom taxonomy in assessments to measure cognitive level of learners in higher learning institutions. But it has not been effective yet to resolve overlapping issue of Bloom taxonomy verb keywords being assigned to more than one category of Bloom taxonomy. The presence of this poses a problem in respect of classifying a particular question into a right category of Bloom taxonomy. And feature extraction plays an important role in improving the accuracy of classifier such as Support Vector Machine in question classification. Much of the past related research work focus on feature extraction methods such as bag of word (BOW) and syntactic analysis to classify questions and to address the issue, an improvement in feature extraction is needed. In view of this, this study proposes an integrated approach in feature extraction involving semantic aspect in classifying questions in accordance to Bloom taxonomy. Support Vector Machine classifier is used as it is well known for its high accuracy in text classification. With all this in place, an improved accuracy in classifying questions in accordance to Bloom taxonomy can be expected.
Keywords :
cognition; computer aided instruction; data mining; feature extraction; pattern classification; support vector machines; text analysis; Bloom taxonomy; cognitive level measure; content similarity; document level; feature extraction approach; higher learning institutions; question classification; support vector machine; syntactic analysis; text classification; text mining; Conferences; Feature extraction; Semantics; Support vector machines; Taxonomy; Text categorization; Text mining; Bloom taxonomy; feature extraction; question classification; support vector machine;
Conference_Titel :
Control System, Computing and Engineering (ICCSCE), 2014 IEEE International Conference on
Print_ISBN :
978-1-4799-5685-2
DOI :
10.1109/ICCSCE.2014.7072776