Why is the number of voxels different from the number of matrices?

Umm… I don’t understand why not necessary.

I want to auto annotation software using deep learning. So i labeling in image’s some part.
if i make a deep learning model that time input file is patient’s nrrd file. reasult is annotation on image.
So i think to combine two files works necessary