DocumentCode :
1430855
Title :
Estimation of Inferential Uncertainty in Assessing Expert Segmentation Performance From STAPLE
Author :
Commowick, Olivier ; Warfield, Simon K.
Author_Institution :
Dept. of Radiol., Comput. Radiol. Lab., Boston, MA, USA
Volume :
29
Issue :
3
fYear :
2010
fDate :
3/1/2010 12:00:00 AM
Firstpage :
771
Lastpage :
780
Abstract :
The evaluation of the quality of segmentations of an image, and the assessment of intra- and inter-expert variability in segmentation performance, has long been recognized as a difficult task. For a segmentation validation task, it may be effective to compare the results of an automatic segmentation algorithm to multiple expert segmentations. Recently an expectation-maximization (EM) algorithm for simultaneous truth and performance level estimation (STAPLE) was developed to this end to compute both an estimate of the reference standard segmentation and performance parameters from a set of segmentations of an image. The performance is characterized by the rate of detection of each segmentation label by each expert in comparison to the estimated reference standard. This previous work provides estimates of performance parameters, but does not provide any information regarding the uncertainty of the estimated values. An estimate of this inferential uncertainty, if available, would allow the estimation of confidence intervals for the values of the parameters. This would facilitate the interpretation of the performance of segmentation generators and help determine if sufficient data size and number of segmentations have been obtained to precisely characterize the performance parameters. We present a new algorithm to estimate the inferential uncertainty of the performance parameters for binary and multicategory segmentations. It is derived for the special case of the STAPLE algorithm based on established theory for general purpose covariance matrix estimation for EM algorithms. The bounds on the performance parameters are estimated by the computation of the observed information matrix. We use this algorithm to study the bounds on performance parameters estimates from simulated images with specified performance parameters, and from interactive segmentations of neonatal brain MRIs. We demonstrate that confidence intervals for expert segmentation performance parameters can b- estimated with our algorithm. We investigate the influence of the number of experts and of the segmented data size on these bounds, showing that it is possible to determine the number of image segmentations and the size of images necessary to achieve a chosen level of accuracy in segmentation performance assessment.
Keywords :
biomedical MRI; brain; covariance matrices; expectation-maximisation algorithm; image segmentation; medical expert systems; medical image processing; STAPLE; automatic segmentation algorithm; binary segmentation; covariance matrix; expectation-maximization algorithm; expert segmentation performance; inferential uncertainty; neonatal brain MRI; reference standard segmentation; segmented data size; simultaneous truth performance level estimation; Brain modeling; Character generation; Computational modeling; Covariance matrix; Image recognition; Image segmentation; Parameter estimation; Pediatrics; Standards development; Uncertainty; Confidence intervals; covariance matrix; expectation-maximization (EM); information matrix; simultaneous truth and performance level estimation (STAPLE); validation; Algorithms; Brain; Computer Simulation; Confidence Intervals; Databases, Factual; Humans; Image Processing, Computer-Assisted; Infant, Newborn; Magnetic Resonance Imaging; Reproducibility of Results;
fLanguage :
English
Journal_Title :
Medical Imaging, IEEE Transactions on
Publisher :
ieee
ISSN :
0278-0062
Type :
jour
DOI :
10.1109/TMI.2009.2036011
Filename :
5423294
Link To Document :
بازگشت