DocumentCode
1994347
Title
Recovering "lack of words" in text categorization for item banks
Author
Nuntiyagul, Atorn ; Cercone, Nick ; Naruedomkul, Kanlaya
Author_Institution
Inst. for Innovation & Dev. of Learning Process, Mahidol Univ., Bangkok, Thailand
Volume
2
fYear
2005
fDate
26-28 July 2005
Firstpage
31
Abstract
PKIP, patterned keywords in phrase, is our feature selection approach to text categorization (TC) for item banks. An item bank is a collection of textual data in which each item consists of short sentences and has only a few relevant words for categorization. Traditional TC techniques cannot provide sufficiently accurate results because of a "lack of words" problem. PKIP improves categorization accuracy and recovers from the "lack of words" problem. Our sample item bank is the collection of Thai primary mathematics problems and we use SVM as our classifier. Classification results show that PKIP produces acceptable classification performance.
Keywords
pattern classification; support vector machines; text analysis; SVM classifier; Thai primary mathematics problem; feature selection; item banks; lack of words recovery; patterned keywords in phrase; short sentences; text categorization; Computer science; Educational institutions; Frequency; Mathematics; Support vector machine classification; Support vector machines; Technological innovation; Text categorization; Web pages; World Wide Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Software and Applications Conference, 2005. COMPSAC 2005. 29th Annual International
ISSN
0730-3157
Print_ISBN
0-7695-2413-3
Type
conf
DOI
10.1109/COMPSAC.2005.128
Filename
1508076
Link To Document