Title :
Building clusters with distributed features for text classification using KNN
Author :
Wajeed, Mohammed Abdul ; Adilakshmi, T.
Author_Institution :
SCSI, Sreenidhi Inst. of Sci. & Technol., Hyderabad, India
Abstract :
Bulk data is generated in the era of Information Technology. If it is not stored in a properly systematic manner then the generated data cannot be reused. This is because navigation becomes if not impossible, certainly very difficult. So we classify the data before it is stored. Present paper explores the techniques to store the data in a supervised classification paradigm using distributed features. Initially Soft, hard and mixed Clusters are build based on the distributed features later the clusters are used to classify the documents based on the K-nearest neighbour classification algorithm.
Keywords :
learning (artificial intelligence); pattern classification; pattern clustering; text analysis; K-nearest neighbour classification algorithm; KNN; bulk data; distributed features; information technology; supervised classification paradigm; text classification; Accuracy; Equations; Informatics; Text categorization; Training; Training data; Vectors; distributed features; knn-classifier; soft-hard clusters; text classification;
Conference_Titel :
Computer Communication and Informatics (ICCCI), 2012 International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4577-1580-8
DOI :
10.1109/ICCCI.2012.6158839