DocumentCode
441861
Title
Application of layered clustering and plane partition in Web page classification
Author
Wang, Li-Xia ; Han, Jian-Min ; Wei, Zhe ; Zhou, Guang-Cheng
Author_Institution
Coll. of Inf. Sci. & Eng., Zhejiang Normal Univ., Jinhua, China
Volume
4
fYear
2005
fDate
18-21 Aug. 2005
Firstpage
2325
Abstract
The layered clustering can create layered nesting class with high precision. But the computing complexity is relatively high so that it is not fitted to solve large amount of sample calculation problems. K-means method has high efficiency while it is affected easily by the choice of the center of initial clustering. So to the irregular distributed samples, the clustering effect is usually not good. The paper focuses on the distribution features and complexities of samples in Web page classification and puts forward a classification method - to combine the layered clustering and plane partition. Firstly we use the algorithm of layered clustering in a few of samples to generate original clustering centers of sample set. Secondly, K-means method is used to classify the whole samples set. This strategy not only takes full advantage of the high efficiency of the K-means algorithm but also makes good use of the high precision and reliability of layered clustering method. Finally, this paper apply the method of combining the layered clustering and plane partition to solving the problems of text classification and presents some experimental results.
Keywords
Internet; classification; computational complexity; data mining; text analysis; K-means method; Web page classification; distribution feature; layered clustering method; plane partition; text classification; Clustering algorithms; Clustering methods; Data mining; Educational institutions; Electronic mail; Information science; Internet; Partitioning algorithms; Text categorization; Web pages; K-means; Text clustering; Web mining; layered clustering;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location
Guangzhou, China
Print_ISBN
0-7803-9091-1
Type
conf
DOI
10.1109/ICMLC.2005.1527332
Filename
1527332
Link To Document