DocumentCode :
1689773
Title :
An enhanced data mining model for text classification
Author :
Nithya, K. ; Kalaivaani, P.C.D. ; Thangarajan, R.
Author_Institution :
Dept. of CSE, Kongu Eng. Coll., Erode, India
fYear :
2012
Firstpage :
1
Lastpage :
4
Abstract :
Classification plays a vital role in many information management and retrieval tasks. This paper studies classification of text document. Text classification is a supervised technique that uses labeled training data to learn the classification system and then automatically classifies the remaining text using the learned system. In this paper, we propose a mining model consists of sentence-based concept analysis, document-based concept analysis, and corpus-based concept-analysis. Then we analyze the term that contributes to the sentence semantics on the sentence, document, and corpus levels rather than the traditional analysis of the document only. After extracting feature vector for each new document, feature selection is performed. It is then followed by K-Nearest Neighbour classification. The approach enhances the text classification accuracy.
Keywords :
data mining; learning (artificial intelligence); pattern classification; text analysis; corpus-based concept analysis; data mining model; document-based concept analysis; feature selection; feature vector extraction; information management task; information retrieval task; k-nearest neighbour classification; learning system; sentence semantics; sentence-based concept analysis; supervised technique; term analysis; text document classification; Accuracy; Analytical models; Classification algorithms; Data models; Feature extraction; Text categorization; Concept analysis; feature selection; feature vector; k-nearest neighbor; supervised; text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computing, Communication and Applications (ICCCA), 2012 International Conference on
Conference_Location :
Dindigul, Tamilnadu
Print_ISBN :
978-1-4673-0270-8
Type :
conf
DOI :
10.1109/ICCCA.2012.6179179
Filename :
6179179
Link To Document :
بازگشت