• DocumentCode
    1930462
  • Title

    An efficient approach to Web page classification using non-linear cellular automata

  • Author

    Kundu, Anirban ; Roy, Debasis

  • Author_Institution
    Netaji Subhash Eng. Coll., West Bengal Univ. of Technol., Kolkata, India
  • fYear
    2010
  • fDate
    28-30 Oct. 2010
  • Firstpage
    313
  • Lastpage
    318
  • Abstract
    In this paper, we propose a Cellular Automata (CA) based implementation in classification for handling huge amount of data on the Web in an efficient way. We concentrate on Multiple Attractor Cellular Automata (MACA) as well as Single Cycle Multiple Attractor Cellular Automata (SMACA), since these are responsible for classifying various types of patterns. CA based implementation results in minimization of storage space needed for storing the downloaded Web pages within a Search Engine. In this paper we are going to use the 3-Neighborhood concept of CA for classification purpose. Efficiency of our approach lies within the usage of CA as a classifier in different forms. Experimental results demonstrate our approach with a higher efficiency level.
  • Keywords
    cellular automata; pattern classification; search engines; Web page classification; multiple attractor cellular automata; nonlinear cellular automata; search engine; single cycle multiple attractor cellular automata; Automata; Crawlers; Grid computing; Indexes; Search engines; Transient analysis; Web pages; 3-neighborhood; CA; Cellular Automata based Classification; Classifier; Difference function; MACA; SMACA; Similarity function;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel Distributed and Grid Computing (PDGC), 2010 1st International Conference on
  • Conference_Location
    Solan
  • Print_ISBN
    978-1-4244-7675-6
  • Type

    conf

  • DOI
    10.1109/PDGC.2010.5679913
  • Filename
    5679913