DocumentCode :
2323420
Title :
Customized term weighting scheme for document classification1
Author :
Benjamin, C.M.X. ; Woon, W.L. ; Wong, K.S.D.
Author_Institution :
Malaysia Univ. of Sci. & Technol., Petaling Jaya
fYear :
2008
fDate :
13-15 May 2008
Firstpage :
294
Lastpage :
299
Abstract :
In this paper, we introduce a novel method based on context awareness, semantic similarities and customized weights for different categories to improve keyword matching. The algorithm is able to weight terms by using category information and semantic relationships with WordNet as a lexical database. To demonstrate the usefulness of the approach, several tests are run comparing against other existing methods such as Salton, Glasgow and Balanced Term Weighting schemes.
Keywords :
document handling; pattern classification; semantic Web; WordNet; category information; context awareness; customized term weighting scheme; document classification; keyword matching; lexical database; semantic relationships; semantic similarities; Context awareness; Databases; Frequency; Indexing; Information retrieval; Internet; Mathematics; Testing; Text analysis; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer and Communication Engineering, 2008. ICCCE 2008. International Conference on
Conference_Location :
Kuala Lumpur
Print_ISBN :
978-1-4244-1691-2
Electronic_ISBN :
978-1-4244-1692-9
Type :
conf
DOI :
10.1109/ICCCE.2008.4580615
Filename :
4580615
Link To Document :
بازگشت