Title :
Ensemble learning approach in improved K Nearest Neighbor algorithm for Text categorization
Author :
Iswarya, P. ; Radha, V.
Author_Institution :
Dept. of Comput. Sci., Avinashilingam Inst. for Home Sci. & Higher Educ. for Women, Coimbatore, India
Abstract :
Due to the tremendous growth of digital content in World Wide Web (WWW), Text categorization has become an important tool to manage and organize text related data. This paper proposes an Ensemble Learning approach in Improved K Nearest Neighbor algorithm for Text Categorization (EINNTC), which consists of single pass clustering, Ensemble learning and KNN algorithm. The EINNTC method provides solution to traditional KNN classifier issues, by reducing the huge text similarity computation complexity, avoids an impact of noisy training sample, and expediting the process of finding K nearest neighbors. The experiments were carried out with standard benchmark Reuters dataset, and their empirical results shows that the proposed method outperforms the SVM and KNN classifiers.
Keywords :
Internet; computational complexity; learning (artificial intelligence); pattern classification; text analysis; EINNTC; EINNTC method; KNN classifier algorithm; SVM classifiers; WWW; World Wide Web; ensemble learning approach in improved k nearest neighbor algorithm for text categorization; noisy training sample; standard benchmark Reuters dataset; text related data management; text related data organization; text similarity computation complexity; Classification algorithms; Clustering algorithms; Conferences; Support vector machines; Technological innovation; Text categorization; Training; Categorization; Clustering; Ensemble; K nearest neighbor; Support vector machine;
Conference_Titel :
Innovations in Information, Embedded and Communication Systems (ICIIECS), 2015 International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4799-6817-6
DOI :
10.1109/ICIIECS.2015.7193250