DocumentCode :
441861
Title :
Application of layered clustering and plane partition in Web page classification
Author :
Wang, Li-Xia ; Han, Jian-Min ; Wei, Zhe ; Zhou, Guang-Cheng
Author_Institution :
Coll. of Inf. Sci. & Eng., Zhejiang Normal Univ., Jinhua, China
Volume :
4
fYear :
2005
fDate :
18-21 Aug. 2005
Firstpage :
2325
Abstract :
The layered clustering can create layered nesting class with high precision. But the computing complexity is relatively high so that it is not fitted to solve large amount of sample calculation problems. K-means method has high efficiency while it is affected easily by the choice of the center of initial clustering. So to the irregular distributed samples, the clustering effect is usually not good. The paper focuses on the distribution features and complexities of samples in Web page classification and puts forward a classification method - to combine the layered clustering and plane partition. Firstly we use the algorithm of layered clustering in a few of samples to generate original clustering centers of sample set. Secondly, K-means method is used to classify the whole samples set. This strategy not only takes full advantage of the high efficiency of the K-means algorithm but also makes good use of the high precision and reliability of layered clustering method. Finally, this paper apply the method of combining the layered clustering and plane partition to solving the problems of text classification and presents some experimental results.
Keywords :
Internet; classification; computational complexity; data mining; text analysis; K-means method; Web page classification; distribution feature; layered clustering method; plane partition; text classification; Clustering algorithms; Clustering methods; Data mining; Educational institutions; Electronic mail; Information science; Internet; Partitioning algorithms; Text categorization; Web pages; K-means; Text clustering; Web mining; layered clustering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
Type :
conf
DOI :
10.1109/ICMLC.2005.1527332
Filename :
1527332
Link To Document :
بازگشت