Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-train all PlantSeg models to eliminate hallucination during inference #224

Open
qin-yu opened this issue Apr 17, 2024 · 0 comments
Open
Assignees
Labels
BioImage.IO Related to BioImage.IO and AI4Life bug Something isn't working enhancement New feature or request

Comments

@qin-yu
Copy link
Collaborator

qin-yu commented Apr 17, 2024

Re-train all PlantSeg models to eliminate hallucination during inference

We have historically addressed the tiling artifact by using larger patch sizes. Recently, @Buglakova re-highlighted this issue in Issue #190, prompting further investigation in the PlantSeg and pytorch-3dunet projects. See the related PR #220 for PlantSeg and PR #113 for pytorch-3dunet.

Findings:
After adjusting the halo implementation, it's clear that nuclear models trained with batch normalization (with the default track_running_stats=True) avoid prediction hallucinations on new datasets, unlike those trained with group normalization.

Previous Assumptions:
We assumed group norm performed better under certain conditions based on earlier hyperparameter tuning with datasets from similar distributions. However, this might need reevaluation as batch norm has proven more stable across diverse datasets.

Action Required:
I propose updating versions for models previously trained with group norm. This is critical to ensure consistency and reliability in our semantic segmentation tasks.

Visual Evidence:

  • Left: Prediction from a model trained with group norm using an ovules nuclear dataset.
  • Right: Prediction from a model trained with batch norm using an ovules nuclear dataset.
  • Middle: Raw image of a mouse embryo (Note: Brightness is increased to highlight the absence of signal).

Comparison of model predictions

@wolny, could you assist in identifying the training datasets used for each model to facilitate these updates? I'll do the training after my CBB Seminar talk tomorrow.

@qin-yu qin-yu added bug Something isn't working enhancement New feature or request BioImage.IO Related to BioImage.IO and AI4Life labels Apr 17, 2024
@qin-yu qin-yu self-assigned this Apr 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
BioImage.IO Related to BioImage.IO and AI4Life bug Something isn't working enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant