Segment comparison: how to interpret true negatives? Precision and recall

Bernard_Victor · February 9, 2025, 4:57pm

Operating system: MacOS 15.3
Slicer version: 5.8.0

Hello,
I’m new to 3DSlicer. I’d like to understand how to interpret true negatives in the ‘Segment comparison’ module. I compare the very close segmentation of a nerve on a ultrasound sequence that occupy about 5% of the screen, and true negative appear to be 25% while it should be around 95% (or a similar order of magnitude).
On the other side, can I use the TN, TP and FN % to calculate precision and recall?

Thank you very much.

Bernard

cpinter · February 11, 2025, 10:08am

For performance reasons, segments are stored only using their “effective extent” i.e. the region that contains non-background voxels. Considering this, the values seem correct. You can potentially extend the segments to the whole image (without checking, I think the best way to do this is to export the segmentation to labelmap using the image as a reference - in the Segmentations module - and then importing it back to segmentation). However, I find Dice a quite vague metric, due to its many dependencies (one is what you just demonstrated, another is that it works very differently with round shapes vs elongated shapes), and I recommend using Hausdorff instead.

Bernard_Victor · February 11, 2025, 12:39pm

Hello and thank you for your kind answer.
Could you please elaborate about non-background voxels? Or point to a tutorial that I could learn from?
Actually the most important is: do the percentages TP, FN and FP here correspond to true overlap of the 2 segments or segmentation without overlap for FN and FP. I want to evaluate segmentation of one user against another one here. So I’m in 2D. I’ll use the software later for 3D analysis and training.

I’m aware of the limitations of DSC. Thank you for the suggestion I’ll add the Hausdorff distance indeed. In nerve segmentation, clinicians (I’m one of them, an anesthesiolgist) like to stick with easy to understand metrics, so DSC is used a lot (a well as IoU). Even precision and recall are hardly mentionned.

Thank you very much in any case.

cpinter · February 11, 2025, 12:56pm

This is the paper about the segmentations infrastructure. A few things have changed since then, for example now we use layers for more efficient storage of segments, but the gist is the same.

Pinter, C., Lasso, A., & Fichtinger, G. (2019). Polymorph segmentation representation for medical image computing. Computer Methods and Programs in Biomedicine, 171, 19–26. Redirecting

I’m quite surprised, as I don’t think DSC is easier to understand than mean/max mm distances, and with elongated structures such as nerves it is borderline useless (1mm difference with a 1mm diameter structure results in a score of 0).

Bernard_Victor · February 12, 2025, 6:43pm

Thank you very much for your kind help and for your advice.
The nerves we study are bigger nerve (5-15 mm diameter). But yes DSC has many problems, including not distinguishing between FP and FN.
I will definitely use also Hausdorf distance in my work too thanks to your knowledgable advice.

Topic		Replies	Views
Dice Similarity compare segment TP and TN correct? Support segmentation	3	445	April 23, 2021
Segment comparison returns 0 for all metrics Development slicerrt	15	667	July 5, 2019
Hausdorff distance calculation in SegmentComparison module Support slicerrt	20	5792	October 10, 2018
Auomate DICE metric calculation Support	1	413	December 10, 2021
Compute Dice score, Hausdorff distance for Multi-Region segmentation Support segmentation	9	1890	May 10, 2024

Segment comparison: how to interpret true negatives? Precision and recall

Related topics