May I ask if multi GPUs inference can be used in Windows or Linux systems? Currently, due to data size issues, it is evident that monailabel infers data in memory, i.e. CPU, without utilizing CUDA acceleration.
If I have multiple 4090 cards in one computer, how can I make them participate in parallel computation for inference?
If you setup CUDA environment correctly, then MonaiLabel will use GPU. It is not using is a sign that torch is not detecting your GPU and driver for some reason. You should follow the pyTorch’s troubleshooting steps.
As for using multiple GPUs for inferring, I don;t think that’s common use case. Inference with a powerful GPU like 4090 is very fast, and shouldn’t be needed.
The question was how to use multiple 4090 cards for inferencing with MONAILabel. The answer is not a simple one: Depending on the application selected, MONAILabel can be serving different models. Whether a model uses single or multiple GPUs is up to the way each particular model was coded and is not a configurable parameter of the MONAILabel serving application. So you would have to modify the code in the particular model(s) that you are using in order to take advantage of multiple GPUs. If you want to pursue this, I suggest further questions will become more of an issue between you and the MONAILabel creators/maintainers at NVIDIA instead of this community. I don’t recommend spending time parallelizing the inference execution. Instead, you could take advantage of multiple GPUs if you are training a new model, since this is a more computationally expensive task.