Implementation of Probabilistic U-Net. #46

naga-karthik · 2021-06-22T05:52:30Z

The primary contribution of this PR is the implementation of Probabilistic U-Net (in 3D) for the MSSeg challenge. Architectural hints and suggestions have been taken from the paper's appendix. Here is a summary of the major changes:

The datasets.py file was modified to load subvolumes of size 128x128x128 directly and importantly, this version does away with the consensus GT (for the time being) and instead loads one of the 4 experts' segmentation at random during training.
The files unet.py and probabilistic_unet.py introduce the PU-Net architecture. As mentioned in the paper, 3 networks are involved during training - the prior net, posterior net and the standard U-Net.
utils.py just adds some basic utility functions for initializing the PU-Net model correctly.
For validation, 5 segmentation masks are produced for each input. These are concatenated and their mean is taken to obtain the predicted "consensus" mask. Dice is calculated b/w this and the actual GT.

Review is primarily required for points 1 and 2 (punet.py in particular), just to ensure that the code is logical. Suggestions and areas for improvement are welcomed!

Things To-Do:

Currently, the reconstruction loss uses PyTorch's BCEWithLogitsLoss. Try using ivadomed's readily available DiceLoss to see how the training fares.
Maybe look at STAPLE for aggregating predicted segmentation masks before calculating validation Dice?

Note: The branch is inappropriately named as run_baselines, which it was originally set out to be. But then it evolved into PU-Net completely.

uzaymacar · 2021-06-28T03:01:52Z

config/softseg_unet3D_balanced.json

@@ -67,53 +70,38 @@
        "length_3D": [128, 128, 128],
        "stride_3D": [64, 64, 64],
        "attention": true,
-        "n_filters": 8
+        "n_filters": 16


I would actually be very interested to know how much difference this makes; we should definitely do an experiment on this!

Sure, definitely! From my experience, increasing the number of feature maps/filters helps in improving performance! Hence, I try to fit as many as possible into the memory.

uzaymacar · 2021-06-28T03:04:09Z

config/softseg_unet3D_unbalanced.json

            "early_stopping_epsilon": 0.001
        },
        "scheduler": {
-            "initial_lr": 5e-05,
+            "initial_lr": 2e-04,


These might be old experiments and highly dependent on the specific hyperparameters (e.g. batch size) but in my experience with the challenge lowering the learning rate was very helpful. I have seen cases where lowering the learning rate from 5e-05 to 3e-05 made a huge difference!

I second this!

Thanks for this! I really didn't know that lowering the learning rate could make that much of a difference. I am struggling to get PU-Net working for this dataset, maybe this might help to some extent! :)

uzaymacar

Looks beautiful 😍, and thanks for introducing us to PUNet! I know we are focusing on more critical stuff now, but I truly think PUNet addresses a problem / scenario that is not addressed by other models. Looking forward to a PUNet with amazing results, and hope my comments can help along that way!

I just want to make a note of the comment I made in your modeling/datasets.py: this is the only bug that I could spot, and so sorry for introducing it in the first place 🙏!

uzaymacar · 2021-06-28T03:11:22Z

modeling/datasets.py

+            assert len(ses01_subvolumes) == len(ses02_subvolumes) == len(gts_subvolumes[0])
+
+            for i in range(len(ses01_subvolumes)):
+                subvolumes_ = {


(Please ignore this review if it doesn't apply!)

I remember doing a similar thing (i.e. reading all GTs from experts) and it completely blew up the (CPU) memory, limiting my experiments severely! This was in the context of multi-GPU training though, so it might not apply here. If this is ever a problem, we could also move the GT-reading part to __get_item() so that it's called for each subvolume at a time. It would slow things down considerably, but just wanted to mention this as an alternative 🙂.

uzaymacar · 2021-06-28T03:17:29Z