مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

2373142

Title :

Further thoughts on precision

Author :

Gray, D. ; Bowes, D. ; Davey, N. ; Yi Sun ; Christianson, B.

Author_Institution :

Comput. Sci. Dept., Univ. of Hertfordshire, Hatfield, UK

fYear :

2011

fDate :

11-12 April 2011

Firstpage :

129

Lastpage :

133

Abstract :

Background: There has been much discussion amongst automated software defect prediction researchers regarding use of the precision and false positive rate classifier performance metrics. Aim: To demonstrate and explain why failing to report precision when using data with highly imbalanced class distributions may provide an overly optimistic view of classifier performance. Method: Well documented examples of how dependent class distribution affects the suitability of performance measures. Conclusions: When using data where the minority class represents less than around 5 to 10 percent of data points in total, failing to report precision may be a critical mistake. Furthermore, deriving the precision values omitted from studies can reveal valuable insight into true classifier performance.

Keywords :

data mining; learning (artificial intelligence); data mining; data points; false positive rate classifier; machine learning; metric performance; software defect automation;

fLanguage :

English

Publisher :

iet

Conference_Titel :

Evaluation & Assessment in Software Engineering (EASE 2011), 15th Annual Conference on

Conference_Location :

Durham

Electronic_ISBN :

978-1-84919-509-6

Type :

conf

DOI :

10.1049/ic.2011.0016

Filename :

6083171

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2373142