Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement option to export proofreading as segmentation #8286

Open
wants to merge 7 commits into
base: master
Choose a base branch
from

Conversation

cdfhalle
Copy link

@cdfhalle cdfhalle commented Dec 16, 2024

Steps to test:

  • In a freshly created Annotation (with agglomerate file mapping) use the proofreading tool and merge the volume annotation with its fallback layer (by clicking onto the three dots next to the desired segmentation layer)
  • In the resulting Dataset the changes should be applied
  • Other ways of materializing volume annotations should still work as before and could also be tested

corresponding voxelytics pr


  • Updated changelog
  • Removed dev-only changes like prints and application.conf edits
  • Considered common edge cases
  • Needs datastore update after deployment

@cdfhalle cdfhalle linked an issue Dec 16, 2024 that may be closed by this pull request
Copy link
Contributor

coderabbitai bot commented Dec 16, 2024

📝 Walkthrough
📝 Walkthrough

Walkthrough

This pull request introduces several enhancements to WEBKNOSSOS, focusing on job processing, annotation export, and user interface improvements. The changes primarily revolve around adding the ability to export proofreading as segmentation, updating job-related API calls, and refining bounding box handling. The modifications span multiple components, including backend controllers, frontend API interfaces, and job processing modals, with a particular emphasis on providing more flexible options for volume annotation jobs.

Changes

File Change Summary
CHANGELOG.unreleased.md Updated to include new features and changes
app/controllers/JobController.scala Added includesProofreading and selectedBoundingBox parameters to runMaterializeVolumeAnnotationJob method
conf/webknossos.latest.routes Updated route for materialization job to include new optional parameters
frontend/javascripts/admin/api/jobs.ts Modified job-related API functions to support new parameters for segmentation annotation jobs
frontend/javascripts/oxalis/view/action-bar/starting_job_modals.tsx Added includesProofreading variable and updated job API call handling

Possibly related PRs

Suggested labels

enhancement

Suggested reviewers

  • MichaelBuessemeyer
  • normanrz

Poem

🐰 Hop, hop, through code so bright,
Bounding boxes take their flight,
Proofreading exports with glee,
Annotations dancing free!
A rabbit's code, precise and neat,
Makes WEBKNOSSOS a coding treat! 🚀


Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

  • Review comments: Directly reply to a review comment made by CodeRabbit. Example:
    • I pushed a fix in commit <commit_id>, please review it.
    • Generate unit testing code for this file.
    • Open a follow-up GitHub issue for this discussion.
  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR. (Beta)
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (7)
CHANGELOG.unreleased.md (1)

16-16: Enhance the changelog entry with more user-facing details.

While the entry correctly documents the new feature, consider expanding it to include usage instructions and requirements. Based on the PR description, you could enhance it like this:

-Added the option to export proofreading as segmentation [#8286](https://github.com/scalableminds/webknossos/pull/8286)
+Added the option to export proofreading as segmentation. Users can now materialize proofreading changes by selecting the three dots next to a segmentation layer in annotations with agglomerate file mapping. The changes will be reflected in the resulting Dataset. [#8286](https://github.com/scalableminds/webknossos/pull/8286)
frontend/javascripts/admin/api/jobs.ts (1)

211-212: Consider adding type validation for the bounding box parameter

The implementation correctly handles the new parameters, but consider adding validation for the selectedBoundingBox parameter to ensure it's a valid Vector6 before converting to string.

 selectedBoundingBox?: Vector6,
): Promise<APIJob> {
+  if (selectedBoundingBox && selectedBoundingBox.length !== 6) {
+    throw new Error('Invalid bounding box: must be a Vector6');
+  }

Also applies to: 227-232

app/controllers/JobController.scala (1)

398-401: Consider adding size limit validation for the bounding box

While the code correctly validates the bounding box for non-super users, consider adding size limit validation to prevent processing extremely large volumes.

def validateBoundingBoxSize(bbox: String): Fox[Unit] = {
  for {
    parsed <- BoundingBox.fromLiteral(bbox).toFox
    _ <- bool2Fox(parsed.width * parsed.height * parsed.depth <= maxVolumeSize) ?~> "boundingBox.tooLarge"
  } yield ()
}
frontend/javascripts/oxalis/view/action-bar/starting_job_modals.tsx (3)

894-894: Consider using useState for better state management

Instead of using a let variable, consider using React's useState hook for managing the includesProofreading state.

-  let includesProofreading = false;
+  const [includesProofreading, setIncludesProofreading] = useState(false);
   const activeSegmentationTracingLayer = useSelector(getActiveSegmentationTracingLayer);
   // ...
   if (tracing.volumes.length === 0) {
     // ...
   } else {
-    includesProofreading = tracing.volumes.some((v) => v.hasEditableMapping === true);
+    useEffect(() => {
+      setIncludesProofreading(tracing.volumes.some((v) => v.hasEditableMapping === true));
+    }, [tracing.volumes]);
   }

Also applies to: 929-930


Line range hint 962-991: Consider adding type safety for the bounding box computation

While the implementation is correct, consider adding type safety for the bounding box computation.

-          const bbox = selectedBoundingBox?.boundingBox
-            ? computeArrayFromBoundingBox(selectedBoundingBox.boundingBox)
-            : undefined;
+          const bbox = selectedBoundingBox?.boundingBox
+            ? ((): Vector6 => {
+                const computed = computeArrayFromBoundingBox(selectedBoundingBox.boundingBox);
+                if (computed.length !== 6) {
+                  throw new Error('Invalid bounding box: must be a Vector6');
+                }
+                return computed as Vector6;
+              })()
+            : undefined;

Line range hint 894-991: Consider splitting the component for better maintainability

The MaterializeVolumeAnnotationModal component has multiple responsibilities. Consider:

  1. Extracting the bounding box logic into a separate hook
  2. Creating a separate component for the job configuration
  3. Adding proper error boundaries

This will improve maintainability and make the code easier to test.

conf/webknossos.latest.routes (1)

271-271: LGTM! Consider adding parameter documentation.

The route change correctly implements the new parameters for proofreading export functionality. The parameter ordering follows good practices with required parameters first and optional ones last.

Consider adding comments to document:

  • The format expected for selectedBoundingBox parameter
  • The impact of includesProofreading on the materialization process
📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a75c6a4 and a935314.

📒 Files selected for processing (5)
  • CHANGELOG.unreleased.md (1 hunks)
  • app/controllers/JobController.scala (3 hunks)
  • conf/webknossos.latest.routes (1 hunks)
  • frontend/javascripts/admin/api/jobs.ts (4 hunks)
  • frontend/javascripts/oxalis/view/action-bar/starting_job_modals.tsx (5 hunks)
🔇 Additional comments (4)
frontend/javascripts/admin/api/jobs.ts (1)

246-247: LGTM: Parameter propagation is correct

The new parameters are correctly propagated from startMaterializingVolumeAnnotationJob to startSegmentationAnnotationDependentJob.

Also applies to: 258-259

app/controllers/JobController.scala (2)

382-384: LGTM: Method signature changes are well-documented

The new parameters are properly added with appropriate types.


412-414: LGTM: Command arguments are properly updated

The new parameters are correctly added to the command arguments JSON object.

frontend/javascripts/oxalis/view/action-bar/starting_job_modals.tsx (1)

960-960: LGTM: Bounding box configuration is correctly tied to proofreading state

The bounding box configuration is properly controlled by the includesProofreading flag.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (1)
app/controllers/JobController.scala (1)

412-414: Handle optional parameters in commandArgs appropriately

When adding volumeLayerName, includesProofreading, and selectedBoundingBoxParsed to commandArgs, ensure that optional parameters are correctly serialized. Directly inserting Option types into a JSON object may result in null values, which might not be handled as expected downstream.

Consider modifying the commandArgs construction to include optional fields only when they are defined:

val baseCommandArgs = Json.obj(
  "organization_id" -> organization._id,
  "dataset_name" -> dataset.name,
  "dataset_directory_name" -> dataset.directoryName,
  "fallback_layer_name" -> fallbackLayerName,
  "annotation_id" -> annotationId,
  "output_segmentation_layer_name" -> outputSegmentationLayerName,
  "annotation_type" -> annotationType,
  "new_dataset_name" -> newDatasetName,
  "merge_segments" -> mergeSegments,
  "includes_proofreading" -> includesProofreading
)

val commandArgs = baseCommandArgs ++
  volumeLayerName.map("volume_layer_name" -> Json.toJson(_)) ++
  selectedBoundingBoxParsed.map("selected_bounding_box" -> Json.toJson(_))

This approach adds optional fields to commandArgs only when they are defined, avoiding potential null values in the JSON object.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a935314 and 70b5ef0.

📒 Files selected for processing (1)
  • app/controllers/JobController.scala (3 hunks)
🔇 Additional comments (2)
app/controllers/JobController.scala (2)

398-401: Confirm proper handling of selectedBoundingBoxParsed

The code conditionally parses selectedBoundingBox based on includesProofreading and checks bounding box limits for non-super users. Ensure that selectedBoundingBoxParsed is correctly handled when includesProofreading is false to avoid potential issues with uninitialized variables or unintended behavior.


382-384: Ensure all calls to runMaterializeVolumeAnnotationJob are updated

The method signature of runMaterializeVolumeAnnotationJob has been updated to include includesProofreading and selectedBoundingBox parameters. Please verify that all invocations of this method elsewhere in the codebase have been updated to match the new signature to prevent runtime errors.

Run the following script to find all usages of runMaterializeVolumeAnnotationJob and check for proper parameter usage:

@@ -13,6 +13,7 @@ For upgrade instructions, please check the [migration guide](MIGRATIONS.released
### Added
- Added the total volume of a dataset to a tooltip in the dataset info tab. [#8229](https://github.com/scalableminds/webknossos/pull/8229)
- Optimized performance of data loading with “fill value“ chunks. [#8271](https://github.com/scalableminds/webknossos/pull/8271)
- Added the option to export proofreading as segmentation [#8286](https://github.com/scalableminds/webknossos/pull/8286)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
- Added the option to export proofreading as segmentation [#8286](https://github.com/scalableminds/webknossos/pull/8286)
- Added the option to export a segmentation that was corrected with the proofreading tool to a new segmentation. [#8286](https://github.com/scalableminds/webknossos/pull/8286)

@@ -925,6 +926,8 @@ export function MaterializeVolumeAnnotationModal({
output dataset and the output segmentation layer.
</p>
);
} else {
includesProofreading = tracing.volumes.some((v) => v.hasEditableMapping === true);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If my annotation contains two volume layers and only one has an editable mapping, that property will be true. However, i I want to export the layer that has no editable mapping, true will still be used in this UI and sent to the backend, which seems wrong?

@@ -954,8 +957,13 @@ export function MaterializeVolumeAnnotationModal({
jobName={"materialize_volume_annotation"}
suggestedDatasetSuffix="with_merged_segmentation"
chooseSegmentationLayer
isBoundingBoxConfigurable={includesProofreading}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

only proofread layers support configuring the bbox? would it be easy to always support it when exporting volume annotations? no prio though.

@@ -208,6 +208,8 @@ function startSegmentationAnnotationDependentJob(
annotationId: string,
annotationType: APIAnnotationType,
mergeSegments?: boolean,
includesProofreading?: boolean,
selectedBoundingBox?: Vector6,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"selected" sounds as if this variable would refer to a bounding box (ID) selected in the UI. I'd rename it to simply boundingBox.

requestURL.searchParams.append("includesProofreading", includesProofreading.toString());
}
if (selectedBoundingBox) {
requestURL.searchParams.append("selectedBoundingBox", selectedBoundingBox.toString());
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
requestURL.searchParams.append("selectedBoundingBox", selectedBoundingBox.toString());
requestURL.searchParams.append("selectedBoundingBox", selectedBoundingBox.join(","));

should be identical, but in your variant one has to know how JS serializes number arrays. I find it better to be explicit about it.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (1)
app/controllers/JobController.scala (1)

398-400: LGTM! Validation logic is secure and well-structured.

The validation appropriately restricts non-super users from processing unbounded areas when proofreading is included. Consider extracting the condition to a named value for improved readability.

 multiUser <- multiUserDAO.findOne(request.identity._multiUser)
+val requiresBoundingBoxValidation = !multiUser.isSuperUser && includesProofreading
-_ <- Fox.runIf(!multiUser.isSuperUser && includesProofreading)(Fox.runOptional(selectedBoundingBox)(bbox =>
+_ <- Fox.runIf(requiresBoundingBoxValidation)(Fox.runOptional(selectedBoundingBox)(bbox =>
   jobService.assertBoundingBoxLimits(bbox, None)))
📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 70b5ef0 and 721fc70.

📒 Files selected for processing (1)
  • app/controllers/JobController.scala (3 hunks)
🔇 Additional comments (3)
app/controllers/JobController.scala (3)

382-384: LGTM! Method signature changes are well-structured.

The new parameters are appropriately typed and named, aligning with the PR's objective to support proofreading export functionality.


411-413: LGTM! Command arguments are properly structured.

The new fields are correctly added to the command arguments, maintaining consistency with the existing JSON structure and naming conventions.


Line range hint 382-413: Verify integration with job processing system.

Let's verify the usage of these new parameters across the codebase to ensure proper integration.

✅ Verification successful

Based on the search results, I can now generate the final response as we have comprehensive information about the integration:

Integration with job processing system is properly implemented.

The changes are well-integrated across the codebase:

  • The job command materialize_volume_annotation is properly registered in JobCommand.scala
  • Route definition in webknossos.latest.routes correctly includes all new parameters
  • Frontend components in starting_job_modals.tsx handle the new parameters
  • Proper error messages are defined in conf/messages
  • Job processing infrastructure (workers) supports this command as shown in dbtool.js
  • Security checks are in place for organization-level access and bounding box limits for non-super users
🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Check for references to materializeVolumeAnnotation job
echo "Checking for job references..."
rg -A 5 "materializeVolumeAnnotation|materialize_volume_annotation"

# Check for related test coverage
echo "Checking for related tests..."
fd -e scala -e java -e py test | xargs rg -l "materializeVolumeAnnotation|materialize_volume_annotation"

# Check for route definitions
echo "Checking route definitions..."
fd routes | xargs rg "materializeVolumeAnnotation"

Length of output: 12019

Copy link
Member

@fm3 fm3 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I took the liberty to add a commit for the backend to make the linter happy, and remove a no-op line (no bbox was parsed there).

Backend LGTM now :)

Testing went well! 🎉

One thing: in the backend code, we usually speak of “editable mapping” rather than “proofreading”. @philippotto can you comment how this is handled in the frontend? Maybe we can unify the naming for the new job parameters as well.

@philippotto
Copy link
Member

@philippotto can you comment how this is handled in the frontend?

yes, that's also how it's done in the front-end.

@fm3
Copy link
Member

fm3 commented Dec 18, 2024

Alternatively, it might make sense to call the parameter use_zarr_streaming. and set this one to true if the frontend has an editable mapping for this layer. This way we don’t have to adapt the worker if the logic will change eventually. Long-term, we might be able to switch more than just proofreading annotations to the zarr streaming 🤔

@philippotto
Copy link
Member

Alternatively, it might make sense to call the parameter use_zarr_streaming. and set this one to true if the frontend has an editable mapping for this layer. This way we don’t have to adapt the worker if the logic will change eventually. Long-term, we might be able to switch more than just proofreading annotations to the zarr streaming 🤔

Yes, sounds good, too.

@philippotto philippotto changed the title implement option to export proofreading as segmentation Implement option to export proofreading as segmentation Dec 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Export editable mapping as segmentation
4 participants