Title :
Species distribution modeling and prediction: A class imbalance problem
Author :
Johnson, R.A. ; Chawla, Nitesh V. ; Hellmann, J.J.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Notre Dame, Notre Dame, IN, USA
Abstract :
Predicting the distributions of species is central to a variety of applications in ecology and conservation biology. With increasing interest in using electronic occurrence records, many modeling techniques have been developed to utilize this data and compute the potential distribution of species as a proxy for actual observations. As the actual observations are typically overwhelmed by non-occurrences, we approach the modeling of species´ distributions with a focus on the problem of class imbalance. Our analysis includes the evaluation of several machine learning methods that have been shown to address the problems of class imbalance, but which have rarely or never been applied to the domain of species distribution modeling. Evaluation of these methods includes the use of the area under the precision-recall curve (AUPR), which can supplement other metrics to provide a more informative assessment of model utility under conditions of class imbalance. Our analysis concludes that emphasizing techniques that specifically address the problem of class imbalance can provide AUROC and AUPR results competitive with traditional species distribution models.
Keywords :
biology computing; ecology; learning (artificial intelligence); pattern classification; area under the precision-recall curve; area under the receiver operating characteristic curve; class imbalance problem; conservation biology; ecology; electronic occurrence records; machine learning methods; species distribution modeling; Biological system modeling; Computational modeling; Decision trees; Measurement; Niobium; Predictive models; Radio frequency;
Conference_Titel :
Intelligent Data Understanding (CIDU), 2012 Conference on
Conference_Location :
Boulder, CO
Print_ISBN :
978-1-4673-4625-2
DOI :
10.1109/CIDU.2012.6382186