Title :
An outlier detection algorithm based on object-oriented metrics thresholds
Author :
Alan, Oral ; Catal, Cagatay
Author_Institution :
Inf. Technol. Inst., TUBITAK-Marmara Res. Center, Kocaeli, Turkey
Abstract :
Detection of outliers in software measurement datasets is a critical issue that affects the performance of software fault prediction models built based on these datasets. Two necessary components of fault prediction models, software metrics and fault data, are collected from the software projects developed with object-oriented programming paradigm. We proposed an outlier detection algorithm based on these kinds of metrics thresholds. We used Random Forests machine learning classifier on two software measurement datasets collected from jEdit open-source text editor project and experiments revealed that our outlier detection approach improves the performance of fault predictors based on Random Forests classifier.
Keywords :
learning (artificial intelligence); object-oriented programming; pattern classification; public domain software; software fault tolerance; software metrics; software performance evaluation; text editing; classifier; jEdit open-source text editor; machine learning; object-oriented metrics thresholds; object-oriented programming; outlier detection; random forests; software fault prediction; software measurement datasets; software metrics; Detection algorithms; Fault detection; Machine learning; Object oriented modeling; Object oriented programming; Open source software; Predictive models; Software measurement; Software metrics; Software performance; metrics thresholds; object-oriented metrics; outlier detection; software fault prediction;
Conference_Titel :
Computer and Information Sciences, 2009. ISCIS 2009. 24th International Symposium on
Conference_Location :
Guzelyurt
Print_ISBN :
978-1-4244-5021-3
Electronic_ISBN :
978-1-4244-5023-7
DOI :
10.1109/ISCIS.2009.5291882