DocumentCode
1930462
Title
An efficient approach to Web page classification using non-linear cellular automata
Author
Kundu, Anirban ; Roy, Debasis
Author_Institution
Netaji Subhash Eng. Coll., West Bengal Univ. of Technol., Kolkata, India
fYear
2010
fDate
28-30 Oct. 2010
Firstpage
313
Lastpage
318
Abstract
In this paper, we propose a Cellular Automata (CA) based implementation in classification for handling huge amount of data on the Web in an efficient way. We concentrate on Multiple Attractor Cellular Automata (MACA) as well as Single Cycle Multiple Attractor Cellular Automata (SMACA), since these are responsible for classifying various types of patterns. CA based implementation results in minimization of storage space needed for storing the downloaded Web pages within a Search Engine. In this paper we are going to use the 3-Neighborhood concept of CA for classification purpose. Efficiency of our approach lies within the usage of CA as a classifier in different forms. Experimental results demonstrate our approach with a higher efficiency level.
Keywords
cellular automata; pattern classification; search engines; Web page classification; multiple attractor cellular automata; nonlinear cellular automata; search engine; single cycle multiple attractor cellular automata; Automata; Crawlers; Grid computing; Indexes; Search engines; Transient analysis; Web pages; 3-neighborhood; CA; Cellular Automata based Classification; Classifier; Difference function; MACA; SMACA; Similarity function;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel Distributed and Grid Computing (PDGC), 2010 1st International Conference on
Conference_Location
Solan
Print_ISBN
978-1-4244-7675-6
Type
conf
DOI
10.1109/PDGC.2010.5679913
Filename
5679913
Link To Document