Title :
Pattern mining based on local distribution
Author :
Yu, Zhiwen ; Wang, Xing ; Wong, Hau-San ; Deng, Zhongkai
Author_Institution :
Dept. of Comput. Sci., City Univ. of Hong Kong, Hong Kong
Abstract :
Pattern mining gains more and more attention due to its useful applications in many areas, such as machine learning, database, multimedia, biology, and so on. Though there exist a lot of approaches for pattern mining, few of them consider the local distribution of the data. In the paper, we not only design six challenge datasets related to the local patterns, but also propose a new pattern mining algorithm based on local distribution. Unlike traditional pattern mining algorithms, our new algorithm first creates a local distribution for each data point by a random approach. Then, the distribution curve of each data point is simulated by the sum of low frequency curves obtained by the wavelet approach. In the third step, the coefficients of these low frequency curves for each data point are clustered by the normalized cut approach. Finally, the patterns of the datasets are obtained by the new pattern mining algorithm. The experiments show that our new algorithm outperforms traditional unsupervised learning approaches, such as K-means, EM, spectral clustering algorithm (SCA), and so on, on these six new datasets.
Keywords :
data mining; pattern clustering; wavelet transforms; distribution curve; frequency curve; local data distribution; local distribution clustering; normalized cut; pattern mining; wavelet approach; Clustering algorithms; Frequency; Machine learning; Multimedia databases; Partitioning algorithms; Supervised learning; Support vector machine classification; Support vector machines; Training data; Unsupervised learning;
Conference_Titel :
Neural Networks, 2008. IJCNN 2008. (IEEE World Congress on Computational Intelligence). IEEE International Joint Conference on
Conference_Location :
Hong Kong
Print_ISBN :
978-1-4244-1820-6
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2008.4633852