DocumentCode :
3399163
Title :
Building clusters with distributed features for text classification using KNN
Author :
Wajeed, Mohammed Abdul ; Adilakshmi, T.
Author_Institution :
SCSI, Sreenidhi Inst. of Sci. & Technol., Hyderabad, India
fYear :
2012
fDate :
10-12 Jan. 2012
Firstpage :
1
Lastpage :
6
Abstract :
Bulk data is generated in the era of Information Technology. If it is not stored in a properly systematic manner then the generated data cannot be reused. This is because navigation becomes if not impossible, certainly very difficult. So we classify the data before it is stored. Present paper explores the techniques to store the data in a supervised classification paradigm using distributed features. Initially Soft, hard and mixed Clusters are build based on the distributed features later the clusters are used to classify the documents based on the K-nearest neighbour classification algorithm.
Keywords :
learning (artificial intelligence); pattern classification; pattern clustering; text analysis; K-nearest neighbour classification algorithm; KNN; bulk data; distributed features; information technology; supervised classification paradigm; text classification; Accuracy; Equations; Informatics; Text categorization; Training; Training data; Vectors; distributed features; knn-classifier; soft-hard clusters; text classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Communication and Informatics (ICCCI), 2012 International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4577-1580-8
Type :
conf
DOI :
10.1109/ICCCI.2012.6158839
Filename :
6158839
Link To Document :
بازگشت