DocumentCode
174536
Title
Improved document classification through enhanced Naive Bayes algorithm
Author
Sathyadevan, Shiju ; Athira, U. ; Sarath, P.R. ; Anjana, V.
Author_Institution
Amrita Cybersecurity Centre, Amrita Vishwa Vidyapeetham, Kollam, India
fYear
2014
fDate
26-28 Aug. 2014
Firstpage
100
Lastpage
104
Abstract
Immense growth in communication has paved way for existence of information across the world in wide separated zones. There exists a need for a mechanism to render apt information to the needy from this enormous source of information. This mechanism is of high demand for educational purposes. Knowledge based cloud (Kloud) proposes a solution to combine together the information in different area, which is managed by several organizations. It then organizes them into different sections and hence providing a platform to furnish relevant information to people in search of it. The paper discusses about a method based on Naive Bayes algorithm to classify documents pushed into "Kloud". A variation to this algorithm has been implemented by calculating term weight using "converged weight" method resulting in better accuracy and speed. A comparative study of proposed variance in classification algorithm against the actual algorithm was performed. Further we also implemented two subclassification algorithms namely hierarchical subclassification and subcategorization using document similarity method.
Keywords
Bayes methods; cloud computing; document handling; knowledge based systems; pattern classification; Kloud; converged weight method; document classification improvement; document similarity method; educational purposes; enhanced naive Bayes algorithm; hierarchical subclassification algorithm; information organization; information rendering; information search; information source; knowledge based cloud; subcategorization algorithm; term weight; Accuracy; Classification algorithms; Computer security; Equations; Support vector machine classification; Text categorization; Training; Converged weight; Document classification; Naïve bayes; Term weight; Word vector;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Science & Engineering (ICDSE), 2014 International Conference on
Conference_Location
Kochi
Print_ISBN
978-1-4799-6870-1
Type
conf
DOI
10.1109/ICDSE.2014.6974619
Filename
6974619
Link To Document