Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

What are correct metadata attributes when adding a zarr array

See original GitHub issue

I have an image layer with prechunked data loaded with x,y,z dimension at 4,4,40nm voxel resolution. I want to add a zarr volume which is stored in z,y,x orientation with 40,8,8nm voxel resolution. I get the following source info after adding:

Screenshot from 2021-07-22 16-20-39

where the zarr array has the following .zattrs

{
    "_ARRAY_DIMENSIONS": ["z", "y", "x"],
    "offset": [
        0,
        0,
        0
    ],
    "resolution": [
        40,
        8,
        8
    ],
    "scale": 255
}

.zarray info

{
    "chunks": [
        15,
        304,
        304
    ],
    "compressor": {
        "id": "gzip",
        "level": 5
    },
    "dtype": "|u1",
    "fill_value": 0,
    "filters": null,
    "order": "C",
    "shape": [
        7063,
        67072,
        124416
    ],
    "zarr_format": 2
}

What is the correct way to do this? Perhaps I can add the resolution and bounds information directly to the metadata files.

Issue Analytics

State:
Created 2 years ago
Comments:14 (5 by maintainers)

Top GitHub Comments

3reactions

d-v-bcommented, Sep 8, 2021

Yes, we are making a constant effort to un-bake the magic dimension names from the spec 😃

1reaction

unidesignercommented, Sep 10, 2021

Thanks @d-v-b for pointing out the discussion around ome/ngff. There seems to be a lot going on, and it seems very nice that a lot of the tools already or have the intention of supporting ngff. I think it would make sense to perhaps contribute some of the discussion/proposals here into possibly augmenting a proposal for the axes labeling in the ngff specification. And then go about adding support for OME/ngff to Neuroglancer, instead of another special-purpose zarr metadata spec that is not supported by a wider community.

A few points for discussion may be:

specification/support of structured data types
overall structure of the json. more like the unit proposal from @jbms , or @d-v-b proposal with separate scale, units fields with arrays.
proposal by @jbms about using '' and null
what format to use for units, e.g. using udunits2 to specify the unit strings
I saw that there is a proposal of adding a spec for sequences of spatial transforms. And @jbms made the point about possibly supporting affines for visualization purpose. I’d find it super nice and useful to have a way to specify an affine for a dataset for visualization purposes, i.e. the affine is applied when rendering the image data in Neuroglancer. I was wondering whether the planned transform specification could be the suitable place to store/look-up this affine. Or whether it would be more closely related to the translate field you have above. There is also the problem of arrays with non-spatial dimension on how to select the relevant axes to apply the affine too. I think there was some discussion about it in the ngff issue, but haven’t seen a conclusive solution yet.
how a translate field (in world units) would impact indexing operations e.g. in tensorstore. I haven’t checked in detail, but is the idea to support indexing operation in world space/units, v.s. index space of the array data itself?

Top Results From Across the Web

The Array class (zarr.core) — zarr 2.13.3 documentation

If True (default), array configuration metadata will be cached for the lifetime of the object. If False, array metadata will be reloaded prior...

Tutorial — zarr 2.13.3 documentation - Read the Docs

Zarr arrays and groups support custom key/value attributes, which can be useful for storing application-specific metadata. For example:.

Zarr storage specification version 2 — zarr 2.13.3 documentation

Each array requires essential configuration metadata to be stored, enabling correct interpretation of the stored data. This metadata is encoded using JSON ...

zarr.core — zarr 0.1.dev50 documentation - Read the Docs

If False, array metadata will be reloaded prior to all data access and ... optional If True (default), user attributes will be cached...

Tutorial — zarr 1.0.1.dev0+dirty documentation - Read the Docs

Internally Zarr uses JSON to store array attributes, so attribute values must be JSON serializable. Tips and tricks¶. Copying large arrays¶. Data can...