Generalized overlap measures for evaluation and validation in medical image analysis

IEEE Trans Med Imaging. 2006 Nov;25(11):1451-61. doi: 10.1109/TMI.2006.880587.

Abstract

Measures of overlap of labelled regions of images, such as the Dice and Tanimoto coefficients, have been extensively used to evaluate image registration and segmentation algorithms. Modern studies can include multiple labels defined on multiple images yet most evaluation schemes report one overlap per labelled region, simply averaged over multiple images. In this paper, common overlap measures are generalized to measure the total overlap of ensembles of labels defined on multiple test images and account for fractional labels using fuzzy set theory. This framework allows a single "figure-of-merit" to be reported which summarises the results of a complex experiment by image pair, by label or overall. A complementary measure of error, the overlap distance, is defined which captures the spatial extent of the nonoverlapping part and is related to the Hausdorff distance computed on grey level images. The generalized overlap measures are validated on synthetic images for which the overlap can be computed analytically and used as similarity measures in nonrigid registration of three-dimensional magnetic resonance imaging (MRI) brain images. Finally, a pragmatic segmentation ground truth is constructed by registering a magnetic resonance atlas brain to 20 individual scans, and used with the overlap measures to evaluate publicly available brain segmentation algorithms.

Publication types

  • Evaluation Study
  • Validation Study

MeSH terms

  • Algorithms*
  • Brain / anatomy & histology*
  • Humans
  • Image Enhancement / methods*
  • Image Interpretation, Computer-Assisted / methods*
  • Information Storage and Retrieval / methods
  • Magnetic Resonance Imaging / methods*
  • Magnetic Resonance Imaging / standards
  • Pattern Recognition, Automated / methods*
  • Pattern Recognition, Automated / standards
  • Quality Assurance, Health Care / methods
  • Quality Assurance, Health Care / standards
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Subtraction Technique*