How to anonymize a scene

lassoan · October 20, 2025, 3:55pm

@chir.set started a discussion in the Slicer issue tracker on github, but it is worth giving more visibility to this to the community:

Please consider this workflow:

load a DICOM volume with patient information

save the scene, as MRB or not

close the scene

close Slicer

start Slicer

load the scene

select the patient in the Data module

show Subject Hierarchy information

Although there are no DICOM files in the saved scene but an NRRD file as scalar volume, DICOM tags are still in the *.mrml file, with PatientName and PatientBirthDate in particular.

image739×1071 81.9 KB

If the patient is removed from the DICOM database, the result is the same.

The problem: when an MRB file is shared, there is patient data leakage. Actually, that’s how I came across this finding, with the last shared MRB file on the Discourse forum. For this reason, I won’t link that post here but can do it in private messaging on request.

The scene file should probably not store values of DICOM tags, at least not containing the above mentioned ones. There’s nothing DICOM related in a saved scene.

Of course, a user may fully anonymise his series before sharing. It just gets cumbersome and most users won’t be doing that.

Thank you for considering.

Response from @pieper:

Since the MRML scene is extensible, we can’t assure that it is free of PHI, but I agree that this is a case where it would be reasonable to assume that no PHI is included.

@cpinter it looks like this can in via the subject hierarchy. Do you have ideas about how this could be automatically removed in this kind of scenario?

lassoan · October 20, 2025, 4:04pm

For many workflows it is essential to be able to go back to the DICOM database - for fetching additional metadata or be able to export results to DICOM. In research workflows, usually the data of the patient cohort is anonymized before you load any data into Slicer. In clinical workflows, usually all patient information preserved in the data set to make sure that you work on the correct patient and to keep the connection to the patient records.

There is a niche but important use case: you get data of an individual patient and then you want to share with people who are not authorized to see PHI.

Slicer supports a very simple, blunt tool for this: save the scene and then load the NRRD file (or other data files). This cuts all ties to the original DICOM, so it is hard to find you way back to the original data if you need it. PHI may still remain in the data via burnt-in annotations (e.g., patient name in the corner of an image in a secondary capture), recognizable face in a 3D scan, etc.
There are several DICOM anonymization tools that can do deidentification in a much more sophisticated way (cleans out PHI more reliably while preserving more data and offering controlled way to go map the data to the original source), which can be used outside Slicer.
I expect that in the not too distant future (within 1-2 years) there will be Slicer extensions for more convenient DICOM data deidentification, because machine learning is very data-hungry and Slicer will need to offer tools for making training data set building more convenient. These tools should work not just on cohorts but on individual patient data sets, too. They would not start from a Slicer scene (because it is not possible to anonymize data once it is read into the scene and accessed by arbitrary modules), but would work by processing the data before is loaded into the Slicer scene.

chir.set · October 20, 2025, 4:42pm

I came up with this brute force solution:

# ----------------------------------------------------------------------------
def anonymiseSubjectHierarchyFrom(startItemId):
  shNode = slicer.vtkMRMLSubjectHierarchyNode.GetSubjectHierarchyNode(slicer.mrmlScene)
  startItemChildrenIds = vtk.vtkIdList()
  shNode.GetItemChildren(startItemId, startItemChildrenIds)

  for i in range(startItemChildrenIds.GetNumberOfIds()):
    startItemNextChildId = startItemChildrenIds.GetId(i)
    if (shNode.GetItemLevel(startItemNextChildId) != "Patient"):
      continue
    if (shNode.HasItemAttribute(startItemNextChildId, "DICOM.PatientName")):
      shNode.SetItemAttribute(startItemNextChildId, "DICOM.PatientName", "Anonymous")
    if (shNode.HasItemAttribute(startItemNextChildId, "DICOM.PatientBirthDate")):
      shNode.SetItemAttribute(startItemNextChildId, "DICOM.PatientBirthDate", "11111111")
    
    anonymiseSubjectHierarchyFrom(startItemNextChildId)

# ----------------------------------------------------------------------------
def anonymiseScene():
  shNode = slicer.vtkMRMLSubjectHierarchyNode.GetSubjectHierarchyNode(slicer.mrmlScene)
  sceneItemId = shNode.GetSceneItemID()
  anonymiseSubjectHierarchyFrom(sceneItemId)

Two suggestions:

Add an option to anonymise the subject hierarchy in the Save dialog.
Add an option in the DICOM module to load a series as anonymous.

pieper · October 20, 2025, 4:44pm

I like this option. It’s clean and sounds more broadly useful.

lassoan · October 20, 2025, 5:28pm

DICOM loading plugins require the data to be in the DICOM database. Therefore, it is not possible to anonymize during DICOM loading without major rework or the DICOM plugins or switching to a temporary database.

It would be much simpler and cleaner to anonymize during import (could be activated in the “Import DICOM files” window) and/or during export (could be activated in “Export to files” window). It is also necessary to be able to do batch processing - anonymizing all data in a folder. All these require integration of an open-source DICOM anonymizer tool with a simple GUI for configuring and running the processing.

pieper · October 20, 2025, 5:59pm

Anonymize on import also makes sense. But what I took @chir.set to be suggesting would be to give the plugins the option of not putting PHI into the mrml scene when loading. Often it’s hard for a generic deidentifier to know what the tags mean, while the plugins are specific to the type of data and probably have a better chance of knowing which contain PHI and generating deidentified variants.

chir.set · October 20, 2025, 6:09pm

Yes, that’s what I meant. The DICOM database can have patient information since it is not a shared object.

lassoan · October 21, 2025, 12:50pm

Anonymize on import also makes sense. But what I took @chir.set to be suggesting would be to give the plugins the option of not putting PHI into the mrml scene when loading.

I agree that this would work for some workflows. However, it would break some modules.

The problem: There are a number of modules in several extensions (for example in SlicerHeart, SlicerCIP, probably also in SlicerRT, QuantitativeReporting and others) that would break if the DICOM instance UIDs were changed or skipped when loading the DICOM files. The instance UIDs are used for looking up additional DICOM fields in the database that are needed for certain operations. The DICOM plugin developers don’t know what additional metadata various modules in other extension will need.

Proposed solution: If data was anonymized during DICOM import or export (and not during loading) then there would be always valid database entries associated with the loaded data. Therefore, all the modules would still work well, while there was no PHI in the scene or in the exported files. These solutions also have the advantage that DICOM plugin developers would not need to know how to do DICOM anonymization properly, which is actually a very complex topic.

A relatively easy immediate solution (until more convenient solution is offered within Slicer) could be to anonymize the DICOM files before loading into Slicer. There are several tools already, for example:

Their feature set varies. Some of them have GUI. Most people would find that one of these work for them with little or no change in anonymization parameters.

chir.set · October 21, 2025, 7:27pm

I have included the functions above in an online module for easy use in the GUI, until a built-in solution appears.

Topic		Replies	Views
How to anonymize DICOM images? Support dicom	12	11480	December 20, 2023
How can i change the patient information in DICOM files using 3D Slicer during import? Support	3	448	April 17, 2024
How to hide patient name Support dicom	9	874	September 27, 2021
Is DICOM to NIFTI a fully anonymised file in Slicer? Support	6	2210	June 15, 2023
Who uses DICOM data bundle? Support	11	1025	June 12, 2017

How to anonymize a scene

Related topics