• DocumentCode
    1994347
  • Title

    Recovering "lack of words" in text categorization for item banks

  • Author

    Nuntiyagul, Atorn ; Cercone, Nick ; Naruedomkul, Kanlaya

  • Author_Institution
    Inst. for Innovation & Dev. of Learning Process, Mahidol Univ., Bangkok, Thailand
  • Volume
    2
  • fYear
    2005
  • fDate
    26-28 July 2005
  • Firstpage
    31
  • Abstract
    PKIP, patterned keywords in phrase, is our feature selection approach to text categorization (TC) for item banks. An item bank is a collection of textual data in which each item consists of short sentences and has only a few relevant words for categorization. Traditional TC techniques cannot provide sufficiently accurate results because of a "lack of words" problem. PKIP improves categorization accuracy and recovers from the "lack of words" problem. Our sample item bank is the collection of Thai primary mathematics problems and we use SVM as our classifier. Classification results show that PKIP produces acceptable classification performance.
  • Keywords
    pattern classification; support vector machines; text analysis; SVM classifier; Thai primary mathematics problem; feature selection; item banks; lack of words recovery; patterned keywords in phrase; short sentences; text categorization; Computer science; Educational institutions; Frequency; Mathematics; Support vector machine classification; Support vector machines; Technological innovation; Text categorization; Web pages; World Wide Web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Software and Applications Conference, 2005. COMPSAC 2005. 29th Annual International
  • ISSN
    0730-3157
  • Print_ISBN
    0-7695-2413-3
  • Type

    conf

  • DOI
    10.1109/COMPSAC.2005.128
  • Filename
    1508076