Title :
Model evaluation of datasets using critical dimension model invariants
Author :
Suryakumar, Divya ; Sung, Andrew H. ; Mazumdar, Subhra ; Liu, Quanwei
Author_Institution :
Dept. of Comput. Sci. & Eng., New Mexico Inst. of Min. & Technol., Socorro, TX, USA
Abstract :
Critical dimension is the minimum number of features that is required to ensure the performance of a learning machine to be “high”. This critical dimension is usually unique to the learning machine and the ranking algorithm combination. Medical- and bio-informatics datasets are different from most other datasets in that there is an imbalance in most of these datasets and a high prediction accuracy often depends upon not just the overall accuracy but also the true positive and the false negative rates. To find a medically and bio-informatically accurate critical dimension and for better analysis of such datasets we develop two evaluation models, one using all features and the other using critical number of features. The performance measurements such as accuracy, specificity, sensitivity, area under the curve, F-score and kappa values are compared. This paper shows that at the critical dimension the evaluation model shows good results for all performance measurements measured on most datasets studied. The difference in performance measurements obtained using only critical number and using all features is significantly less, i.e., there is not much difference in sensitivity, specificity and other measurements calculated.
Keywords :
bioinformatics; data mining; learning (artificial intelligence); bio-informatic datasets; bio-informatically accurate critical dimension; critical dimension model invariants; dataset model evaluation; false negative rates; learning machine performance; medical datasets; medically accurate critical dimension; performance measurements; prediction accuracy; ranking algorithm; true positive rates; Decision support systems; High definition video; Intelligent systems; Critical dimension; feature reduction; sensitivity and specificity;
Conference_Titel :
Intelligent Systems Design and Applications (ISDA), 2012 12th International Conference on
Conference_Location :
Kochi
Print_ISBN :
978-1-4673-5117-1
DOI :
10.1109/ISDA.2012.6416629