Segment comparison: how to interpret true negatives? Precision and recall

Operating system: MacOS 15.3
Slicer version: 5.8.0

Hello,
I’m new to 3DSlicer. I’d like to understand how to interpret true negatives in the ‘Segment comparison’ module. I compare the very close segmentation of a nerve on a ultrasound sequence that occupy about 5% of the screen, and true negative appear to be 25% while it should be around 95% (or a similar order of magnitude).
On the other side, can I use the TN, TP and FN % to calculate precision and recall?

Thank you very much.

Bernard

For performance reasons, segments are stored only using their “effective extent” i.e. the region that contains non-background voxels. Considering this, the values seem correct. You can potentially extend the segments to the whole image (without checking, I think the best way to do this is to export the segmentation to labelmap using the image as a reference - in the Segmentations module - and then importing it back to segmentation). However, I find Dice a quite vague metric, due to its many dependencies (one is what you just demonstrated, another is that it works very differently with round shapes vs elongated shapes), and I recommend using Hausdorff instead.

Hello and thank you for your kind answer.
Could you please elaborate about non-background voxels? Or point to a tutorial that I could learn from?
Actually the most important is: do the percentages TP, FN and FP here correspond to true overlap of the 2 segments or segmentation without overlap for FN and FP. I want to evaluate segmentation of one user against another one here. So I’m in 2D. I’ll use the software later for 3D analysis and training.

I’m aware of the limitations of DSC. Thank you for the suggestion I’ll add the Hausdorff distance indeed. In nerve segmentation, clinicians (I’m one of them, an anesthesiolgist) like to stick with easy to understand metrics, so DSC is used a lot (a well as IoU). Even precision and recall are hardly mentionned.

Thank you very much in any case.

This is the paper about the segmentations infrastructure. A few things have changed since then, for example now we use layers for more efficient storage of segments, but the gist is the same.

Pinter, C., Lasso, A., & Fichtinger, G. (2019). Polymorph segmentation representation for medical image computing. Computer Methods and Programs in Biomedicine, 171, 19–26. Redirecting

I’m quite surprised, as I don’t think DSC is easier to understand than mean/max mm distances, and with elongated structures such as nerves it is borderline useless (1mm difference with a 1mm diameter structure results in a score of 0).

Thank you very much for your kind help and for your advice.
The nerves we study are bigger nerve (5-15 mm diameter). But yes DSC has many problems, including not distinguishing between FP and FN.
I will definitely use also Hausdorf distance in my work too thanks to your knowledgable advice.