Title :
Customized term weighting scheme for document classification1
Author :
Benjamin, C.M.X. ; Woon, W.L. ; Wong, K.S.D.
Author_Institution :
Malaysia Univ. of Sci. & Technol., Petaling Jaya
Abstract :
In this paper, we introduce a novel method based on context awareness, semantic similarities and customized weights for different categories to improve keyword matching. The algorithm is able to weight terms by using category information and semantic relationships with WordNet as a lexical database. To demonstrate the usefulness of the approach, several tests are run comparing against other existing methods such as Salton, Glasgow and Balanced Term Weighting schemes.
Keywords :
document handling; pattern classification; semantic Web; WordNet; category information; context awareness; customized term weighting scheme; document classification; keyword matching; lexical database; semantic relationships; semantic similarities; Context awareness; Databases; Frequency; Indexing; Information retrieval; Internet; Mathematics; Testing; Text analysis; Unsupervised learning;
Conference_Titel :
Computer and Communication Engineering, 2008. ICCCE 2008. International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-1691-2
Electronic_ISBN :
978-1-4244-1692-9
DOI :
10.1109/ICCCE.2008.4580615