From aa620faa1e9defda057d16078100652c67836898 Mon Sep 17 00:00:00 2001 From: David Stansby Date: Fri, 22 Nov 2024 12:08:41 +0100 Subject: [PATCH 1/4] Clarify that axes MUST be a list of dictionaries --- 0.5/index.bs | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/0.5/index.bs b/0.5/index.bs index 3f326ffa..23a55181 100644 --- a/0.5/index.bs +++ b/0.5/index.bs @@ -179,7 +179,7 @@ The OME-Zarr Metadata version MUST be consistent within a hierarchy. "axes" metadata {#axes-md} -------------------------- -"axes" describes the dimensions of a physical coordinate space. It is a list of dictionaries, where each dictionary describes a dimension (axis) and: +"axes" describes the dimensions of a physical coordinate space. It MUST be list of dictionaries, where each dictionary describes a dimension (axis) and: - MUST contain the field "name" that gives the name for this dimension. The values MUST be unique across all "name" fields. - SHOULD contain the field "type". It SHOULD be one of "space", "time" or "channel", but MAY take other string values for custom axis types that are not part of this specification yet. - SHOULD contain the field "unit" to specify the physical unit of this dimension. The value SHOULD be one of the following strings, which are valid units according to UDUNITS-2. From 01d1c5d652dbc6af4a52943ac5aa2e49c4c026a9 Mon Sep 17 00:00:00 2001 From: Josh Moore Date: Fri, 22 Nov 2024 13:43:35 +0100 Subject: [PATCH 2/4] Update 0.5/index.bs --- 0.5/index.bs | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/0.5/index.bs b/0.5/index.bs index 23a55181..76541715 100644 --- a/0.5/index.bs +++ b/0.5/index.bs @@ -179,7 +179,7 @@ The OME-Zarr Metadata version MUST be consistent within a hierarchy. "axes" metadata {#axes-md} -------------------------- -"axes" describes the dimensions of a physical coordinate space. It MUST be list of dictionaries, where each dictionary describes a dimension (axis) and: +"axes" describes the dimensions of a physical coordinate space. It MUST be a list of dictionaries, where each dictionary describes a dimension (axis) and: - MUST contain the field "name" that gives the name for this dimension. The values MUST be unique across all "name" fields. - SHOULD contain the field "type". It SHOULD be one of "space", "time" or "channel", but MAY take other string values for custom axis types that are not part of this specification yet. - SHOULD contain the field "unit" to specify the physical unit of this dimension. The value SHOULD be one of the following strings, which are valid units according to UDUNITS-2. From a8c650c5da9167981cb484510d8742ca568b9d2a Mon Sep 17 00:00:00 2001 From: David Stansby Date: Mon, 25 Nov 2024 08:34:30 +0000 Subject: [PATCH 3/4] Add changelog entry --- 0.5/index.bs | 83 ++++++++++++++++++++++++++++------------------------ 1 file changed, 45 insertions(+), 38 deletions(-) diff --git a/0.5/index.bs b/0.5/index.bs index 76541715..b799e84a 100644 --- a/0.5/index.bs +++ b/0.5/index.bs @@ -52,8 +52,8 @@ Storage format {#storage-format} OME-Zarr is implemented using the Zarr format as defined by the [version 3 of the Zarr specification](https://zarr-specs.readthedocs.io/en/latest/v3/core/v3.0.html). -All features of the Zarr format including codecs, chunk grids, chunk -key encodings, data types and storage transformers may be used with +All features of the Zarr format including codecs, chunk grids, chunk +key encodings, data types and storage transformers may be used with OME-Zarr unless explicitly disallowed in this specification. An overview of the layout of an OME-Zarr fileset should make @@ -87,7 +87,7 @@ Note that the number of dimensions is variable between 2 and 5 and that axis nam │ ├── zarr.json # All image arrays must be up to 5-dimensional │ │ # with the axis of type time before type channel, before spatial axes. │ │ - │ └─ ... # Chunks are stored conforming to the Zarr array specification and + │ └─ ... # Chunks are stored conforming to the Zarr array specification and │ # metadata as specified in the array's `zarr.json`. │ └── labels @@ -106,7 +106,7 @@ Note that the number of dimensions is variable between 2 and 5 and that axis nam │ ├── 0 # Each multiscale level is stored as a separate Zarr array, as above, but only integer values └── ... # are supported. - + @@ -143,7 +143,7 @@ A well group SHOULD NOT be present if there are no images in the well. │ │ ├── 0 # First field of view of well A1 │ │ │ │ │ │ │ ├── zarr.json # Implements "multiscales", "omero" - │ │ │ ├── 0 # Resolution levels + │ │ │ ├── 0 # Resolution levels │ │ │ ├── ... │ │ │ └── labels # Labels (optional) │ │ └── ... # Other fields of view @@ -154,11 +154,11 @@ A well group SHOULD NOT be present if there are no images in the well. OME-Zarr Metadata {#metadata} ============================= -The "OME-Zarr Metadata" contains metadata keys as specified below +The "OME-Zarr Metadata" contains metadata keys as specified below for discovering certain types of data, especially images. The OME-Zarr Metadata is stored in the various `zarr.json` files throughout the above array -hierarchy. In this file, the metadata is stored under the namespaced key +hierarchy. In this file, the metadata is stored under the namespaced key `ome` in `attributes`. The version of the OME-Zarr Metadata is denoted as a string in the `version` attribute within the `ome` namespace. @@ -293,8 +293,8 @@ The transformations in the list are applied sequentially and in order. "multiscales" metadata {#multiscale-md} --------------------------------------- -Metadata about an image can be found under the "multiscales" key in the group-level OME-Zarr Metadata. -Here, image refers to 2 to 5 dimensional data representing image or volumetric data with optional time or channel axes. +Metadata about an image can be found under the "multiscales" key in the group-level OME-Zarr Metadata. +Here, image refers to 2 to 5 dimensional data representing image or volumetric data with optional time or channel axes. It is stored in a multiple resolution representation. "multiscales" contains a list of dictionaries where each entry describes a multiscale image. @@ -384,18 +384,18 @@ for more information. "labels" metadata {#labels-md} ------------------------------ -In OME-Zarr, Zarr arrays representing pixel-annotation data are stored in a group called "labels". Some applications--notably image segmentation--produce -a new image that is in the same coordinate system as a corresponding multiscale image (usually having the same dimensions and coordinate transformations). -This new image is composed of integer values corresponding to certain labels with custom meanings. For example, pixels take the value 1 or 0 -if the corresponding pixel in the original image represents cellular space or intercellular space, respectively. -Such an image is referred to in this specification as a 'label image'. +In OME-Zarr, Zarr arrays representing pixel-annotation data are stored in a group called "labels". Some applications--notably image segmentation--produce +a new image that is in the same coordinate system as a corresponding multiscale image (usually having the same dimensions and coordinate transformations). +This new image is composed of integer values corresponding to certain labels with custom meanings. For example, pixels take the value 1 or 0 +if the corresponding pixel in the original image represents cellular space or intercellular space, respectively. +Such an image is referred to in this specification as a 'label image'. -The "labels" group is nested within an image group, at the same level of the Zarr hierarchy as the resolution levels for the original image. -The "labels" group is not itself an image; it contains images. The pixels of the label images MUST be integer data types, i.e. one of -[`uint8`, `int8`, `uint16`, `int16`, `uint32`, `int32`, `uint64`, `int64`]. Intermediate groups between "labels" and the images within it are allowed, +The "labels" group is nested within an image group, at the same level of the Zarr hierarchy as the resolution levels for the original image. +The "labels" group is not itself an image; it contains images. The pixels of the label images MUST be integer data types, i.e. one of +[`uint8`, `int8`, `uint16`, `int16`, `uint32`, `int32`, `uint64`, `int64`]. Intermediate groups between "labels" and the images within it are allowed, but these MUST NOT contain metadata. Names of the images in the "labels" group are arbitrary. -The OME-Zarr Metadata in the `zarr.json` file associated with the "labels" group MUST contain a JSON object with the key `labels`, whose value is a JSON array of paths to the +The OME-Zarr Metadata in the `zarr.json` file associated with the "labels" group MUST contain a JSON object with the key `labels`, whose value is a JSON array of paths to the labeled multiscale image(s). All label images SHOULD be listed within this metadata file. For example: ```json @@ -413,32 +413,32 @@ labeled multiscale image(s). All label images SHOULD be listed within this metad } ``` -The `zarr.json` file for the label image MUST implement the multiscales specification. Within the `multiscales` object, the JSON array -associated with the `datasets` key MUST have the same number of entries (scale levels) as the original unlabeled image. +The `zarr.json` file for the label image MUST implement the multiscales specification. Within the `multiscales` object, the JSON array +associated with the `datasets` key MUST have the same number of entries (scale levels) as the original unlabeled image. -In addition to the `multiscales` key, the OME-Zarr Metadata in this image-level `zarr.json` file SHOULD contain another key, `image-label`, -whose value is also a JSON object. The `image-label` object stores information about the display colors, source image, and optionally, -further arbitrary properties of the label image. That `image-label` object SHOULD contain the following keys: first, a `colors` key, -whose value MUST be a JSON array describing color information for the unique label values. Second, a `version` key, whose value MUST be a +In addition to the `multiscales` key, the OME-Zarr Metadata in this image-level `zarr.json` file SHOULD contain another key, `image-label`, +whose value is also a JSON object. The `image-label` object stores information about the display colors, source image, and optionally, +further arbitrary properties of the label image. That `image-label` object SHOULD contain the following keys: first, a `colors` key, +whose value MUST be a JSON array describing color information for the unique label values. Second, a `version` key, whose value MUST be a string specifying the version of the OME-Zarr `image-label` schema. -Conforming readers SHOULD display labels using the colors specified by the `colors` JSON array, as follows. This array contains one -JSON object for each unique custom label. Each of these objects MUST contain the `label-value` key, whose value MUST be the integer -corresponding to a particular label. In addition to the `label-value` key, the objects in this array MAY contain an `rgba` key whose -value MUST be an array of four integers between 0 and 255, inclusive. These integers represent the `uint8` values of red, green, and -blue that comprise the final color to be displayed at the pixels with this label. The fourth integer in the `rgba` array represents alpha, -or the opacity of the color. Additional keys under `colors` are allowed. +Conforming readers SHOULD display labels using the colors specified by the `colors` JSON array, as follows. This array contains one +JSON object for each unique custom label. Each of these objects MUST contain the `label-value` key, whose value MUST be the integer +corresponding to a particular label. In addition to the `label-value` key, the objects in this array MAY contain an `rgba` key whose +value MUST be an array of four integers between 0 and 255, inclusive. These integers represent the `uint8` values of red, green, and +blue that comprise the final color to be displayed at the pixels with this label. The fourth integer in the `rgba` array represents alpha, +or the opacity of the color. Additional keys under `colors` are allowed. Next, the `image-label` object MAY contain the following keys: a `properties` key, and a `source` key. -Like the `colors` key, the value of the `properties` key MUST be an array of JSON objects describing the set of unique possible pixel values. -Each object in the `properties` array MUST contain the `label-value` key, whose value again MUST be an integer specifying the pixel value for that label. -Additionally, an arbitrary number of key-value pairs MAY be present for each label value, denoting arbitrary metadata associated with that label. +Like the `colors` key, the value of the `properties` key MUST be an array of JSON objects describing the set of unique possible pixel values. +Each object in the `properties` array MUST contain the `label-value` key, whose value again MUST be an integer specifying the pixel value for that label. +Additionally, an arbitrary number of key-value pairs MAY be present for each label value, denoting arbitrary metadata associated with that label. Label-value objects within the `properties` array do not need to have the same keys. -The value of the `source` key MUST be a JSON object containing information about the original image from which the label image derives. -This object MAY include a key `image`, whose value MUST be a string specifying the relative path to a Zarr image group. -The default value is `../../` since most labeled images are stored in a "labels" group that is nested within the original image group. +The value of the `source` key MUST be a JSON object containing information about the original image from which the label image derives. +This object MAY include a key `image`, whose value MUST be a string specifying the relative path to a Zarr image group. +The default value is `../../` since most labeled images are stored in a "labels" group that is nested within the original image group. Here is an example of a simple `image-label` object for a label image in which 0s and 1s represent intercellular and cellular space, respectively: @@ -447,8 +447,8 @@ path: examples/label_strict/colors_properties.json highlight: json -In this case, the pixels consisting of a 0 in the Zarr array will be displayed as 50% blue and 50% opacity. Pixels with a 1 in the Zarr array, -which correspond to cellular space, will be displayed as 50% green and 50% opacity. +In this case, the pixels consisting of a 0 in the Zarr array will be displayed as 50% blue and 50% opacity. Pixels with a 1 in the Zarr array, +which correspond to cellular space, will be displayed as 50% green and 50% opacity. "plate" metadata {#plate-md} ---------------------------- @@ -565,6 +565,13 @@ Multi-word keys in this specification should use the `camelCase` style. NB: some parts of the specification don't obey this convention as they were added before this was adopted, but they should be updated in due course. +Changelog +========= + +0.6 +--- +- Clarified that "axes" MUST be a list of dictionaries in section 2.1. + Implementations {#implementations} ================================== From 6eaff3ee75bb566175db869497a65798f0199d24 Mon Sep 17 00:00:00 2001 From: David Stansby Date: Mon, 25 Nov 2024 08:55:32 +0000 Subject: [PATCH 4/4] Add some more clarifications --- 0.5/index.bs | 39 ++++++++++++++++++++++++++------------- 1 file changed, 26 insertions(+), 13 deletions(-) diff --git a/0.5/index.bs b/0.5/index.bs index b799e84a..5919cf74 100644 --- a/0.5/index.bs +++ b/0.5/index.bs @@ -158,11 +158,11 @@ The "OME-Zarr Metadata" contains metadata keys as specified below for discovering certain types of data, especially images. The OME-Zarr Metadata is stored in the various `zarr.json` files throughout the above array -hierarchy. In this file, the metadata is stored under the namespaced key +hierarchy. In this file, the metadata MUST be stored under the namespaced key `ome` in `attributes`. -The version of the OME-Zarr Metadata is denoted as a string in the `version` attribute within the `ome` namespace. +The version of the OME-Zarr Metadata MUST be denoted using a string in the `version` attribute within the `ome` namespace. -The OME-Zarr Metadata version MUST be consistent within a hierarchy. +The OME-Zarr Metadata version MUST be identical within all OME-Zarr metadata files hierarchy. ```json { @@ -182,7 +182,7 @@ The OME-Zarr Metadata version MUST be consistent within a hierarchy. "axes" describes the dimensions of a physical coordinate space. It MUST be a list of dictionaries, where each dictionary describes a dimension (axis) and: - MUST contain the field "name" that gives the name for this dimension. The values MUST be unique across all "name" fields. - SHOULD contain the field "type". It SHOULD be one of "space", "time" or "channel", but MAY take other string values for custom axis types that are not part of this specification yet. -- SHOULD contain the field "unit" to specify the physical unit of this dimension. The value SHOULD be one of the following strings, which are valid units according to UDUNITS-2. +- SHOULD contain the field "unit" to specify the physical unit of this dimension. The value MAY be an arbitrary object, but SHOULD be one of the following strings, which are valid units according to UDUNITS-2. - Units for "space" axes: 'angstrom', 'attometer', 'centimeter', 'decimeter', 'exameter', 'femtometer', 'foot', 'gigameter', 'hectometer', 'inch', 'kilometer', 'megameter', 'meter', 'micrometer', 'mile', 'millimeter', 'nanometer', 'parsec', 'petameter', 'picometer', 'terameter', 'yard', 'yoctometer', 'yottameter', 'zeptometer', 'zettameter' - Units for "time" axes: 'attosecond', 'centisecond', 'day', 'decisecond', 'exasecond', 'femtosecond', 'gigasecond', 'hectosecond', 'hour', 'kilosecond', 'megasecond', 'microsecond', 'millisecond', 'minute', 'nanosecond', 'petasecond', 'picosecond', 'second', 'terasecond', 'yoctosecond', 'yottasecond', 'zeptosecond', 'zettasecond' @@ -219,22 +219,22 @@ series.ome.zarr # One converted fileset from bioformats2raw

Attributes

-The OME-Zarr Metadata in the top-level `zarr.json` file must contain the `bioformats2raw.layout` key: +The OME-Zarr Metadata in the top-level `zarr.json` file MUST contain the `bioformats2raw.layout` key:
 path: examples/bf2raw/image.json
 highlight: json
 
If the top-level group represents a plate, the `bioformats2raw.layout` metadata will be present but -the "plate" key MUST also be present, takes precedence and parsing of such datasets should follow [[#plate-md]]. It is not -possible to mix collections of images with plates at present. +the "plate" key MUST also be present, takes precedence and parsing of such datasets should follow [[#plate-md]]. +`bioformats2raw` datasets that represent a plate MUST NOT contain any other images.
 path: examples/bf2raw/plate.json
 highlight: json
 
-The OME-Zarr Metadata in the `zarr.json` file within the OME group may contain the "series" key: +The OME-Zarr Metadata in the `zarr.json` file within the OME group MAY contain the "series" key:
 path: examples/ome/series-2.json
@@ -301,19 +301,23 @@ It is stored in a multiple resolution representation.
 
 Each "multiscales" dictionary MUST contain the field "axes", see [[#axes-md]].
 The length of "axes" must be between 2 and 5 and MUST be equal to the dimensionality of the zarr arrays storing the image data (see "datasets:path").
-The "axes" MUST contain 2 or 3 entries of "type:space" and MAY contain one additional entry of "type:time" and MAY contain one additional entry of "type:channel" or a null / custom type.
-The order of the entries MUST correspond to the order of dimensions of the zarr arrays. In addition, the entries MUST be ordered by "type" where the "time" axis must come first (if present), followed by the  "channel" or custom axis (if present) and the axes of type "space".
+"axes" MUST contain 2 or 3 entries of "type:space" and MAY contain one additional entry of "type:time" and MAY contain one additional entry of "type:channel" or a null / custom type.
+The order of the entries MUST correspond to the order of dimensions of the zarr arrays.
+In addition, the entries MUST be ordered by "type" where the "time" axis must come first (if present), followed by the  "channel" or custom axis (if present) and the axes of type "space".
 If there are three spatial axes where two correspond to the image plane ("yx") and images are stacked along the other (anisotropic) axis ("z"), the spatial axes SHOULD be ordered as "zyx".
 
 Each "multiscales" dictionary MUST contain the field "datasets", which is a list of dictionaries describing the arrays storing the individual resolution levels.
 Each dictionary in "datasets" MUST contain the field "path", whose value contains the path to the array for this resolution relative
 to the current zarr group. The "path"s MUST be ordered from largest (i.e. highest resolution) to smallest.
 
-Each "datasets" dictionary MUST have the same number of dimensions and MUST NOT have more than 5 dimensions. The number of dimensions and order MUST correspond to number and order of "axes".
+Each "datasets" dictionary MUST have the same number of dimensions and MUST NOT have more than 5 dimensions.
+The number of dimensions and order MUST correspond to number and order of "axes".
 Each dictionary in "datasets" MUST contain the field "coordinateTransformations", which contains a list of transformations that map the data coordinates to the physical coordinates (as specified by "axes") for this resolution level.
 The transformations are defined according to [[#trafo-md]]. The transformation MUST only be of type `translation` or `scale`.
-They MUST contain exactly one `scale` transformation that specifies the pixel size in physical units or time duration. If scaling information is not available or applicable for one of the axes, the value MUST express the scaling factor between the current resolution and the first resolution for the given axis, defaulting to 1.0 if there is no downsampling along the axis.
-It MAY contain exactly one `translation` that specifies the offset from the origin in physical units. If `translation` is given it MUST be listed after `scale` to ensure that it is given in physical coordinates.
+They MUST contain exactly one `scale` transformation that specifies the pixel size in physical units or time duration.
+If scaling information is not available or applicable for one of the axes, the value MUST express the scaling factor between the current resolution and the first resolution for the given axis, defaulting to 1.0 if there is no downsampling along the axis.
+It MAY contain exactly one `translation` that specifies the offset from the origin in physical units.
+If `translation` is given it MUST be listed after `scale` to ensure that it is given in physical coordinates.
 The length of the `scale` and `translation` array MUST be the same as the length of "axes".
 The requirements (only `scale` and `translation`, restrictions on order) are in place to provide a simple mapping from data coordinates to physical coordinates while being compatible with the general transformation spec.
 
@@ -571,6 +575,15 @@ Changelog
 0.6
 ---
 - Clarified that "axes" MUST be a list of dictionaries in section 2.1.
+- Clarified that OME-Zarr Metadata MUST be stored under the "ome" key in the
+  zarr attributes JSON in section 2.
+- Clarified that OME-Zarr version MUST be denoted using a string under the `version` attribute
+  in section 2.
+- Stipluate that the OME-Zarr metadata version must be "identical" (changed from "consistent")
+  within all OME-Zarr metadata files in a single hierarchy (section 2).
+- Clarified that units can be arbitrary objects (section 2.1)
+- Clarified that `bioformats2raw` datasets that represent a plate
+  must not contain other images (section 2.2).
 
 Implementations {#implementations}
 ==================================