DocumentCode :
1375469
Title :
Automated Diagnosis of Diseases Based on Classification: Dynamic Determination of the Number of Trees in Random Forests Algorithm
Author :
Tripoliti, Evanthia E. ; Fotiadis, Dimitrios I. ; Manis, George
Author_Institution :
Dept. of Comput. Sci., Univ. of Ioannina, Ioannina, Greece
Volume :
16
Issue :
4
fYear :
2012
fDate :
7/1/2012 12:00:00 AM
Firstpage :
615
Lastpage :
622
Abstract :
The accurate diagnosis of diseases with high prevalence rate, such as Alzheimer, Parkinson, diabetes, breast cancer, and heart diseases, is one of the most important biomedical problems whose administration is imperative. In this paper, we present a new method for the automated diagnosis of diseases based on the improvement of random forests classification algorithm. More specifically, the dynamic determination of the optimum number of base classifiers composing the random forests is addressed. The proposed method is different from most of the methods reported in the literature, which follow an overproduce-and-choose strategy, where the members of the ensemble are selected from a pool of classifiers, which is known a priori. In our case, the number of classifiers is determined during the growing procedure of the forest. Additionally, the proposed method produces an ensemble not only accurate, but also diverse, ensuring the two important properties that should characterize an ensemble classifier. The method is based on an online fitting procedure and it is evaluated using eight biomedical datasets and five versions of the random forests algorithm (40 cases). The method decided correctly the number of trees in 90% of the test cases.
Keywords :
biomedical MRI; cancer; cardiology; image classification; medical image processing; randomised algorithms; trees (mathematics); Alzheimer diseases; Parkinson diseases; automated disease diagnosis; base classifiers; biomedical datasets; biomedical problems; breast cancer; diabetes; dynamic trees determination; fMRI; heart diseases; online fitting procedure; overproduce-and-choose strategy; random forests classification algorithm; Accuracy; Classification algorithms; Correlation; Diseases; Fitting; Radio frequency; Vegetation; Classification of diseases; random forests; Algorithms; Databases, Factual; Decision Trees; Diagnosis, Computer-Assisted; Disease; Humans;
fLanguage :
English
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
Publisher :
ieee
ISSN :
1089-7771
Type :
jour
DOI :
10.1109/TITB.2011.2175938
Filename :
6080732
Link To Document :
بازگشت