DocumentCode
3165903
Title
Automatic classification of Web information based on site structure
Author
Kening, Gao ; Leiming, Yang ; Bin, Zhang ; Qiaozi, Chai ; Anxiang, Ma
Author_Institution
Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang
fYear
2005
fDate
23-25 Nov. 2005
Lastpage
558
Abstract
How to classify automatically Web information that grows explosive is becoming an imminent problem needed to be resolved. Based on site structure, we propose, in this paper, a new mechanism of automatic classification of Web information, which downloads Web pages within a Web site, records the hyperlinks among Web pages, catches the site structure, extracts the classifying system of the site itself, and then links categorizing information with the correspondent position in the site structure. Therefore automatic classification of Web information can be realized through matching the positions of categorizing information with the positions of Web pages. Experiments show that such classification based on site structure works more accurately and efficiently
Keywords
Web sites; classification; Web information automatic classification; Web page; Web site structure; Classification tree analysis; Data mining; Educational institutions; Explosives; Information science; Information systems; Machine learning; Navigation; Statistics; Web pages;
fLanguage
English
Publisher
ieee
Conference_Titel
Cyberworlds, 2005. International Conference on
Conference_Location
Singapore
Print_ISBN
0-7695-2378-1
Type
conf
DOI
10.1109/CW.2005.24
Filename
1587594
Link To Document