Title :
Research on Ontology-Based Text Representation of Vector Space Model
Author :
Wei, Guiying ; Bao, Mingming ; Wu, Sen
Author_Institution :
Sch. of Econ. & Manage., Univ. of Sci. &Technol. Beijing, Beijing, China
Abstract :
In traditional Vector Space Model (VSM) the TF*IDF method is widely used to adjust the weight of terms in text mining. However TF*EDF can not represent the semantic information of text by neglecting the semantic relevance between terms. In this paper, an improved ontology-based VSM is presented, in which the ontology-based term similarity is used to readjust the weight of semantically related terms. The experimental results show that the improved VSM can perform more accurately than the traditional VSM in the calculation of the term weights.
Keywords :
data mining; ontologies (artificial intelligence); text analysis; vectors; ontology; text mining; text representation; vector space model; Biological system modeling; Classification algorithms; Computational modeling; Dynamic scheduling; Ontologies; Resource management; Semantics;
Conference_Titel :
Database Technology and Applications (DBTA), 2010 2nd International Workshop on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-6975-8
Electronic_ISBN :
978-1-4244-6977-2
DOI :
10.1109/DBTA.2010.5658938