Title :
Impact of missing data in training artificial neural networks for computer-aided diagnosis
Author :
Markey, M.K. ; Patel, A.
Abstract :
Artificial neural networks (ANN) are frequently used in the development of Computer-Aided Diagnosis systems for breast cancer detection and diagnosis. One class of models uses descriptions of mammographic lesions encoded following the BI-RADS lexicon. Data sets that have been carefully curated to ensure completeness are generally used; however, in routine practice, some information is typically missing in clinical databases. The impact of missing data on the performance of a feedforward, back-propagation ANN, as measured by the area under the Receiver Operating Characteristic curve, was found to be much higher when data were missing from the testing set than when data were missing from the training set. This empirical study highlights the need for additional research on developing robust clinical decision support systems for realistic environments in which key information may be unknown or inaccessible.
Keywords :
Artificial neural networks; Breast cancer; Computer aided diagnosis; Data systems; Image databases; Intelligent networks; Lesions; Mammography; Pathology; Predictive models;
Conference_Titel :
Machine Learning and Applications, 2004. Proceedings. 2004 International Conference on
Conference_Location :
Louisville, Kentucky, USA
Print_ISBN :
0-7803-8823-2
DOI :
10.1109/ICMLA.2004.1383534