Title :
Melanoma prediction using data mining system LERS
Author :
Grzymala-Busse, Jan P. ; Grzymala-Busse, Jerzy W. ; Hippe, Zdzislaw S.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Kansas Univ., Lawrence, KS, USA
fDate :
6/23/1905 12:00:00 AM
Abstract :
One of the important tools for early diagnosis of malignant melanoma is the total dermatoscopy score (TDS), computed using the ABCD (asymmetry, border, color, diameter) formula. Our primary objective was to check whether the ABCD formula is optimal. Using a data set containing 276 cases of melanoma and the LERS (Learning from Examples based on Rough Sets) data mining system, we checked more than 20,000 modified formulas for ABCD, computing the predicted error rate of melanoma diagnosis using 10-fold cross-validation for every modified formula. As a result, we found the optimal ABCD formula for our setup: discretization based on cluster analysis, the LEM2 (Learning from Examples Module, version 2) algorithm (one of the four LERS algorithms for rule induction) and the standard LERS classification scheme. The error rate for the standard ABCD formula was 10.21 %, while for the optimal ABCD formula the error rate was reduced to 6.04%. Some research in melanoma diagnosis shows that the use of the ABCD formula does not improve the error rate. Our research shows that the ABCD formula is useful, since, for our data set, the error rate without the use of the ABCD formula was higher (13.73%)
Keywords :
cancer; data mining; image classification; learning by example; medical diagnostic computing; medical expert systems; medical image processing; pattern clustering; skin; ABCD formula optimality; LEM2 algorithm; LERS classification scheme; LERS data mining system; asymmetry; border; cluster analysis; color; cross-validation; diameter; discretization; error rate prediction; learning from examples; malignant melanoma diagnosis; melanoma prediction; rough sets; rule induction; skin cancer; total dermatoscopy score; Algorithm design and analysis; Cancer; Classification algorithms; Clustering algorithms; Computer science; Data mining; Error analysis; Lesions; Malignant tumors; Systems engineering and theory;
Conference_Titel :
Computer Software and Applications Conference, 2001. COMPSAC 2001. 25th Annual International
Conference_Location :
Chicago, IL
Print_ISBN :
0-7695-1372-7
DOI :
10.1109/CMPSAC.2001.960676