DocumentCode :
11012
Title :
Information theory, kelly betting, risk, reward, commission, and omission: An example problem in breast cancer
Author :
Dalton, Leslie W.
Author_Institution :
Private Practice, Austin, TX, USA
Volume :
2
fYear :
2014
fDate :
2014
Firstpage :
1272
Lastpage :
1280
Abstract :
In binary classification, two-way confusion matrices, with corresponding measures, such as sensitivity and specificity, have become so ubiquitous that those who review results may not realize there are other and more realistic ways to visualize data. This is, particularly, true when risk and reward considerations are important. The approach suggested here proposes that classification need not offer a conclusion on every instance within a data set. If an algorithm finds instances (e.g., patient cases in a medical data set) in which attributes pertaining to a patient´s disease offer zero to nil information, there should be no classification offered. From the physician´s perspective, disclosure of nil information should be welcome because it might prevent potentially harmful treatment. It follows from this that the developer of a classifier can provide summary results amendable for helping the consumer decide whether or not it is prudent to pass or act (commission versus omission). It is not always about balancing sensitivity and specificity in all cases, but optimizing action on some cases. The explanation is centered on John Kelly´s link of gambling with Shannon information theory. In addition, Graham´s margin of safety, Bernoulli´s utiles, and Hippocratic Oath are important. An example problem is provided using a Netherlands Cancer Institute breast cancer data set. Recurrence score, a popular molecular-based assay for breast cancer prognosis, was found to have an uninformative zone. The uninformative subset had been grouped with positive results to garner higher sensitivity. Yet, because of a positive result, patients might be advised to undergo potentially harmful treatment in the absence of useful information.
Keywords :
cancer; information theory; medical diagnostic computing; molecular biophysics; patient diagnosis; Bernoulli utiles; Graham margin of safety; Kelly Betting; Shannon information theory; binary classification; breast cancer; breast cancer prognosis; hippocratic oath; medical data set; molecular-based assay; patient disease; recurrence score; reward considerations; risk considerations; sensitivity; specificity; two-way confusion matrices; Breast cancer; Cancer; Classification; Data visualization; Information theory; Safety; Sensitivity and specificity; Shannon theory; Data analysis; Kelly; betting; breast; cancer; clinical diagnosis; criterion; data compression; entropy; genetic expression; information; mutual; recurrence; sensitivity and specificity; theory;
fLanguage :
English
Journal_Title :
Access, IEEE
Publisher :
ieee
ISSN :
2169-3536
Type :
jour
DOI :
10.1109/ACCESS.2014.2363134
Filename :
6936324
Link To Document :
بازگشت