Title :
An efficient approach to Web page classification using non-linear cellular automata
Author :
Kundu, Anirban ; Roy, Debasis
Author_Institution :
Netaji Subhash Eng. Coll., West Bengal Univ. of Technol., Kolkata, India
Abstract :
In this paper, we propose a Cellular Automata (CA) based implementation in classification for handling huge amount of data on the Web in an efficient way. We concentrate on Multiple Attractor Cellular Automata (MACA) as well as Single Cycle Multiple Attractor Cellular Automata (SMACA), since these are responsible for classifying various types of patterns. CA based implementation results in minimization of storage space needed for storing the downloaded Web pages within a Search Engine. In this paper we are going to use the 3-Neighborhood concept of CA for classification purpose. Efficiency of our approach lies within the usage of CA as a classifier in different forms. Experimental results demonstrate our approach with a higher efficiency level.
Keywords :
cellular automata; pattern classification; search engines; Web page classification; multiple attractor cellular automata; nonlinear cellular automata; search engine; single cycle multiple attractor cellular automata; Automata; Crawlers; Grid computing; Indexes; Search engines; Transient analysis; Web pages; 3-neighborhood; CA; Cellular Automata based Classification; Classifier; Difference function; MACA; SMACA; Similarity function;
Conference_Titel :
Parallel Distributed and Grid Computing (PDGC), 2010 1st International Conference on
Conference_Location :
Solan
Print_ISBN :
978-1-4244-7675-6
DOI :
10.1109/PDGC.2010.5679913