DocumentCode
3264543
Title
Applying RDF Ontologies to Improve Text Classification
Author
Xiaoyue, Wang ; Rujiang, Bai
Author_Institution
Shandong Univ. of Technol. Libr., Zibo, China
Volume
2
fYear
2009
fDate
6-7 June 2009
Firstpage
118
Lastpage
121
Abstract
Current classification methods are based on the ldquobag of wordsrdquo (BOW) representation, which only accounts for term frequency in the documents, and ignores important semantic relationships between key terms. In this paper, we proposed a system that uses ontologies and natural language processing techniques to index texts. Traditional BOW matrix is replaced by ldquoBag of Conceptsrdquo (BOC). For this purpose, we developed fully automated methods for mapping keywords to their corresponding ontology concepts. Support vector machine a successful machine learning technique is used for classification. Experimental results shows that our proposed method dose improve text classification performance significantly.
Keywords
learning (artificial intelligence); natural language processing; ontologies (artificial intelligence); support vector machines; text analysis; BOW matrix; RDF ontologies; bag of words representation; machine learning technique; natural language processing techniques; support vector machine; text classification; Electronic mail; Frequency; Indexing; Libraries; Ontologies; Resource description framework; Support vector machine classification; Support vector machines; Text categorization; Vocabulary; RDF; SVM; ontology; text classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Natural Computing, 2009. CINC '09. International Conference on
Conference_Location
Wuhan
Print_ISBN
978-0-7695-3645-3
Type
conf
DOI
10.1109/CINC.2009.115
Filename
5231026
Link To Document