Title of article :
An Efficient Data Preprocessing Procedure for Support Vector Clustering
Author/Authors :
Wang, Jeen-Shing National Cheng Kung University - Department of Electrical Engineering, Taiwan , Chiang, Jen-Chieh National Cheng Kung University - Department of Electrical Engineering, Taiwan
From page :
705
To page :
721
Abstract :
This paper presents an efficient data preprocessing procedure for the support of vector clustering (SVC) to reduce the size of a training dataset. Solving the optimization problem and labeling the data points with cluster labels are time-consuming in the SVC training procedure. This makes using SVC to process large datasets inefficient. We proposed a data preprocessing procedure to solve the problem. The procedure contains a shared nearest neighbor (SNN) algorithm, and utilizes the concept of unit vectors for eliminating insignificant data points from the dataset. Computer simulations have been conducted on artificial and benchmark datasets to demonstrate the effectiveness of the proposed method.
Keywords :
Support Vector Clustering , Shared Nearest Neighbors , Noise Elimination
Journal title :
Journal of J.UCS (Journal of Universal Computer Science)
Journal title :
Journal of J.UCS (Journal of Universal Computer Science)
Record number :
2661551
Link To Document :
بازگشت