• DocumentCode
    174536
  • Title

    Improved document classification through enhanced Naive Bayes algorithm

  • Author

    Sathyadevan, Shiju ; Athira, U. ; Sarath, P.R. ; Anjana, V.

  • Author_Institution
    Amrita Cybersecurity Centre, Amrita Vishwa Vidyapeetham, Kollam, India
  • fYear
    2014
  • fDate
    26-28 Aug. 2014
  • Firstpage
    100
  • Lastpage
    104
  • Abstract
    Immense growth in communication has paved way for existence of information across the world in wide separated zones. There exists a need for a mechanism to render apt information to the needy from this enormous source of information. This mechanism is of high demand for educational purposes. Knowledge based cloud (Kloud) proposes a solution to combine together the information in different area, which is managed by several organizations. It then organizes them into different sections and hence providing a platform to furnish relevant information to people in search of it. The paper discusses about a method based on Naive Bayes algorithm to classify documents pushed into "Kloud". A variation to this algorithm has been implemented by calculating term weight using "converged weight" method resulting in better accuracy and speed. A comparative study of proposed variance in classification algorithm against the actual algorithm was performed. Further we also implemented two subclassification algorithms namely hierarchical subclassification and subcategorization using document similarity method.
  • Keywords
    Bayes methods; cloud computing; document handling; knowledge based systems; pattern classification; Kloud; converged weight method; document classification improvement; document similarity method; educational purposes; enhanced naive Bayes algorithm; hierarchical subclassification algorithm; information organization; information rendering; information search; information source; knowledge based cloud; subcategorization algorithm; term weight; Accuracy; Classification algorithms; Computer security; Equations; Support vector machine classification; Text categorization; Training; Converged weight; Document classification; Naïve bayes; Term weight; Word vector;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Science & Engineering (ICDSE), 2014 International Conference on
  • Conference_Location
    Kochi
  • Print_ISBN
    978-1-4799-6870-1
  • Type

    conf

  • DOI
    10.1109/ICDSE.2014.6974619
  • Filename
    6974619