DocumentCode :
2373142
Title :
Further thoughts on precision
Author :
Gray, D. ; Bowes, D. ; Davey, N. ; Yi Sun ; Christianson, B.
Author_Institution :
Comput. Sci. Dept., Univ. of Hertfordshire, Hatfield, UK
fYear :
2011
fDate :
11-12 April 2011
Firstpage :
129
Lastpage :
133
Abstract :
Background: There has been much discussion amongst automated software defect prediction researchers regarding use of the precision and false positive rate classifier performance metrics. Aim: To demonstrate and explain why failing to report precision when using data with highly imbalanced class distributions may provide an overly optimistic view of classifier performance. Method: Well documented examples of how dependent class distribution affects the suitability of performance measures. Conclusions: When using data where the minority class represents less than around 5 to 10 percent of data points in total, failing to report precision may be a critical mistake. Furthermore, deriving the precision values omitted from studies can reveal valuable insight into true classifier performance.
Keywords :
data mining; learning (artificial intelligence); data mining; data points; false positive rate classifier; machine learning; metric performance; software defect automation;
fLanguage :
English
Publisher :
iet
Conference_Titel :
Evaluation & Assessment in Software Engineering (EASE 2011), 15th Annual Conference on
Conference_Location :
Durham
Electronic_ISBN :
978-1-84919-509-6
Type :
conf
DOI :
10.1049/ic.2011.0016
Filename :
6083171
Link To Document :
بازگشت