Title :
Feature extended short text categorization based on theme ontology
Author :
Zhan, Yan ; Chen, Hao
Author_Institution :
Key Lab. of Machine Learning & Comput. Intell., Hebei Univ., Baoding, China
Abstract :
Short text classification problem as text classification a branch, in addition to the same with traditional text classification to a certain degree, still need to face some special problems to be solved, because of short text length, features sparse, measuring the words difficultly. Due to the nature of ontology emphasize related field concept, which has the defect of few characteristics in short text, it is necessary to emphasize the relationship between semantic. This paper uses the feature extended method based on theme Ontology. As considering the semantic relations, it can get better classification performance compared to the conventional method. Meanwhile, using Case-Base Maintenance learning via the GC (Generalization Capability) algorithm, which can reduce the case number into K-NN algorithm, can improve efficiency when indexing near neighbor in K-Nearest Neighbor algorithm. The numerical experiments prove the validity of this learning algorithm.
Keywords :
learning (artificial intelligence); ontologies (artificial intelligence); pattern classification; text analysis; K-nearest neighbor algorithm; case-base maintenance learning; classification performance; feature extended short text categorization; field concept; generalization capability algorithm; short text length; text classification; theme ontology; Algorithm design and analysis; Classification algorithms; Educational institutions; Ontologies; Semantics; Text categorization; Training; CBM; Short Text Categorization; Theme Ontology;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4673-0025-4
DOI :
10.1109/FSKD.2012.6234220