Title :
Semi-supervised k-means clustering for outlier detection in mammogram classification
Author :
Thangavel, K. ; Mohideen, A.K.
Author_Institution :
Dept. of Comput. Sci., Periyar Univ., Salem, India
Abstract :
Detection of outliers and relevant features are the most important process before classification. In this paper, a novel semi-supervised k-means clustering is proposed for outlier detection in mammogram classification. Initially the shape features are extracted from the digital mammograms, and k-means clustering is applied to cluster the features, the number of clusters is equal with the number of classes. The clusters are compared with original classes, the wrongly clustered instances are identified as outliers and they are removed from the feature space. A novel Genetic Association Rule Miner (GARM) is applied with this reduced feature set to construct the association rules for classification. The performance is analyzed with rough set using Receiver Operating Characteristic (ROC) curve analysis. The mammogram images from MIAS (Mammogram Image Analysis Society) and DDSM (Digital Database for Screening Mammography) were used to evaluate the performance.
Keywords :
data mining; feature extraction; image classification; mammography; medical image processing; pattern clustering; radiology; shape recognition; DDSM; GARM; MIAS; ROC; digital database for screening mammography; digital mammogram image; genetic association rule miner; mammogram classification; mammogram image analysis society; outlier detection; radiology method; receiver operating characteristic curve analysis; semisupervised k-means clustering; shape feature extraction; Accuracy; Delta-sigma modulation; Feature extraction; Image segmentation; Lesions; Pixel; Shape; Mammogram; Outlier Detection; Shape Features; k-Means Clustering;
Conference_Titel :
Trendz in Information Sciences & Computing (TISC), 2010
Conference_Location :
Chennai
Print_ISBN :
978-1-4244-9007-3
DOI :
10.1109/TISC.2010.5714611