DocumentCode :
2888602
Title :
Text Classification Based on a Combination of Ontology with Statistical Method
Author :
Yu, Feng ; Zheng, De-quan ; Zhao, Tie-jun ; Li, Sheng ; Yu, Hao
Author_Institution :
Sch. of Comput. & Inf. Eng., Harbin Univ. of Commerce
fYear :
2006
fDate :
13-16 Aug. 2006
Firstpage :
1042
Lastpage :
1047
Abstract :
Text classification is becoming one of the key techniques in organizing and handling a large amount of text data. In this paper, a combination of ontology with statistical method is presented to improve the precision of text classification. In this study, first, different kind of linguistic ontology knowledge will be respectively acquired by learning training corpus to determine text classifiers. For an actual document, the semantic evaluation value of the document will respectively be gotten by different kind of linguistic ontology knowledge and the categories will be judged by the highest evaluation value. Compared with Bayes, k-nearest neighbor and support vector machine, the proposed approach outperforms previous works
Keywords :
classification; computational linguistics; knowledge acquisition; learning (artificial intelligence); ontologies (artificial intelligence); statistical analysis; text analysis; Bayes method; document semantic evaluation value; k-nearest neighbor method; linguistic ontology knowledge acquisition; statistical method; support vector machine method; text classification; text data handling; training corpus learning; Cybernetics; Data engineering; Machine learning; Natural language processing; Natural languages; Ontologies; Organizing; Statistical analysis; Support vector machine classification; Support vector machines; Text categorization; Training data; Text classification; linguistic ontology knowledge; ontology; statistical method;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2006 International Conference on
Conference_Location :
Dalian, China
Print_ISBN :
1-4244-0061-9
Type :
conf
DOI :
10.1109/ICMLC.2006.258557
Filename :
4028217
Link To Document :
بازگشت