Examples¶

Logging¶

Import the get_logger method from zetta_utils.log and use it to obtain a logger object. Then use this instead of print and warnings.warn statements.

>>> from zetta_utils.log import get_logger
>>> logger = get_logger('test')
>>> logger.warning("This is a warning")
>>> logger.info("Info message")
>>> logger.exception(RuntimeError)

Tensor Operations¶

Generic ops:

>>> from zetta_utils import tensor_ops 
>>> import numpy as np
>>> a = np.ones((2, 2))
>>> a = tensor_ops.unsqueeze(a)
>>> print (a.shape)
(1, 2, 2)

>>> from zetta_utils import tensor_ops
>>> import torch
>>> t = torch.ones((2, 2))
>>> t = tensor_ops.unsqueeze(t)
>>> print (t.shape)
torch.Size([1, 2, 2])

Generic conversion:

>>> from zetta_utils import tensor_ops
>>> import numpy as np
>>> a = np.ones((2, 2))
>>> a = tensor_ops.convert.to_torch(a)
>>> print (type(a))
<class 'torch.Tensor'>

>>> from zetta_utils import tensor_ops
>>> import torch
>>> t = torch.ones((2, 2))
>>> t = tensor_ops.convert.to_torch(t)
>>> print (type(t))
<class 'torch.Tensor'>

BBox3D¶

>>> from zetta_utils.geometry import BBox3D
>>> bbox = BBox3D.from_coords(
...    start_coord=(100, 100, 10),
...    end_coord=(200, 200, 20),
...    resolution=(4, 4, 40)
... )
>>> print(bbox)
BBox3D(bounds=((400, 800), (400, 800), (400, 800)), unit='nm', pprint_px_resolution=(1, 1, 1))
>>> slices = bbox.to_slices(resolution=(16, 16, 100))
>>> print(slices)
(slice(25, 50, None), slice(25, 50, None), slice(4, 8, None))

Layers¶

Layers for CloudVolume IO:

>>> from zetta_utils.layer.volumetric.cloudvol import build_cv_layer
>>> from zetta_utils.geometry import Vec3D
>>> # Vanilla CloudVolume Analog
>>> # Differences with Vanilla CV:
>>> #   1. Read data type: ``torch.Tensor``.
>>> #   2. Dimension order: CXYZ
>>> cvl = build_cv_layer(
...    path="https://storage.googleapis.com/fafb_v15_aligned/v0/img/img_norm"
... )
>>> data = cvl[Vec3D(64, 64, 40), 7500:7564, 2250:2314, 2000:2001]
>>> data.shape # channel, x, y, z
(1, 64, 64, 1)


>>> from zetta_utils.layer.volumetric.cloudvol import build_cv_layer
>>> from zetta_utils.geometry import Vec3D
>>> # Advanced features:
>>> # Custom index resolution, desired resolution, data resolution
>>> cvl = build_cv_layer(
...    path="https://storage.googleapis.com/fafb_v15_aligned/v0/img/img_norm",
...    default_desired_resolution=Vec3D(64, 64, 40),
...    index_resolution=Vec3D(4, 4, 40),
...    data_resolution=Vec3D(128, 128, 40),
...    interpolation_mode="img",
... )
>>> data = cvl[120000:121024, 36000:37024, 2000:2001] # (4, 4, 40) indexing
>>> data.shape # channel, x, y, z
(1, 64, 64, 1)

Layer sets for grouping layers together:

>>> from zetta_utils.geometry import Vec3D
>>> from zetta_utils.layer.volumetric.cloudvol import build_cv_layer
>>> from zetta_utils.layer import build_layer_set
>>> cvl_x0 = build_cv_layer(
...    path="https://storage.googleapis.com/fafb_v15_aligned/v0/img/img"
... )
>>> cvl_x1 = build_cv_layer(
...    path="https://storage.googleapis.com/fafb_v15_aligned/v0/img/img_norm"
... )
>>> # Combine the two layers
>>> lset = build_layer_set(
...    layers={"img": cvl_x0, "img_norm": cvl_x1}
... )

Datasets¶

You can wrap any layer (include layer set) as a PyTorch dataset. In this example we will make a dataset out of the following layer set:

>>> from zetta_utils.layer.volumetric.cloudvol import build_cv_layer
>>> from zetta_utils.layer import build_layer_set
>>> lset = build_layer_set(layers={
...    'img': build_cv_layer(path="https://storage.googleapis.com/fafb_v15_aligned/v0/img/img"),
...    'img_norm': build_cv_layer(path="https://storage.googleapis.com/fafb_v15_aligned/v0/img/img_norm"),
... })

To form a layer dataset, we need to specify both the layer and a mapping from sample number to an index that the layer understands. Such mapping, referred to as sample indexer, will determine what bounding cube is used to fetch training sample #0, #1, etc, as well as specify how many training samples there will be in total. In this example, we will be using VolumetricStridedIndexer (the chunk_id is unused in this example):

>>> from zetta_utils import training
>>> from zetta_utils.geometry import BBox3D, Vec3D
>>> from zetta_utils.layer.volumetric.cloudvol import build_cv_layer
>>> from zetta_utils.layer import build_layer_set
>>> indexer = training.datasets.sample_indexers.VolumetricStridedIndexer(
...    # Range over which to sample
...    bbox=BBox3D.from_coords(
...       start_coord=Vec3D(1000, 1000, 2000),
...       end_coord=Vec3D(2000, 2000, 2100),
...       resolution=Vec3D(64, 64, 40)
...    ),
...    # How big each chunk will be
...    chunk_size=Vec3D(128, 128, 1),
...    # Which resolution we want
...    resolution=Vec3D(64, 64, 40),
...    # How close together samples can be
...    stride=Vec3D(32, 32, 1),
...    # What to do if `bbox` doesn't divide evenly
...    mode="shrink",
... )
>>> print(len(indexer)) # total number of samples
78400
>>> print(indexer(0))
VolumetricIndex(resolution=Vec3D(64, 64, 40), bbox=BBox3D(bounds=((64000.0, 72192.0), (64000.0, 72192.0), (80000.0, 80040.0)), unit='nm', pprint_px_resolution=(1, 1, 1)), chunk_id=0, allow_slice_rounding=False)
>>> print(indexer(1))
VolumetricIndex(resolution=Vec3D(64, 64, 40), bbox=BBox3D(bounds=((66048.0, 74240.0), (64000.0, 72192.0), (80000.0, 80040.0)), unit='nm', pprint_px_resolution=(1, 1, 1)), chunk_id=0, allow_slice_rounding=False)
>>> print(indexer(78399))
VolumetricIndex(resolution=Vec3D(64, 64, 40), bbox=BBox3D(bounds=((119296.0, 127488.0), (119296.0, 127488.0), (83960.0, 84000.0)), unit='nm', pprint_px_resolution=(1, 1, 1)), chunk_id=0, allow_slice_rounding=False)
>>> dset = training.datasets.LayerDataset(
...    layer=lset,
...    sample_indexer=indexer,
... )
>>> sample = dset[0]
>>> print (list(sample.keys()))
['img', 'img_norm']
>>> print (sample['img'].shape)
torch.Size([1, 128, 128, 1])

Builder¶

builder provides machinery to represent layers, datasets, or any other registered components as dictionaries. This can be used to pass in flexible parameters to CLI tools and to allow flexible, readable specifications of training and inference workflow through json/yaml/cue fields.

To make objects of a class buildable with builder:

>>> from zetta_utils import builder
>>> @builder.register("SomeClass")
... class SomeClass:
...    def __init__(self, a):
...       self.a = a

After an object type is registered, you can represent them as dictionaries by including the matching @type key and providing the initialization parameters:

>>> spec = {
...    "@type": "SomeClass",
...    "a": 100
... }
>>> obj = builder.build(spec)
>>> print (type(obj))
<class 'SomeClass'>
>>> print (obj.a)
100

All user-facing zetta_utils objects are registered with builder on module import. Don’t forget to import all zetta_utils modules that you want the builder to know about. You can check out the state of the current registry by inspecting builder.REGISTRY

builder will build your objects recursively. That means that you can specify complex structures, such as the dataset from the earlier example:

>>> from zetta_utils import builder
>>> from zetta_utils import  training
>>> spec = {
...    "@type": "LayerDataset",
...    "layer": {
...       "@type": "build_layer_set",
...       "layers": {
...          "img": {"@type": "build_cv_layer", "path": "https://storage.googleapis.com/fafb_v15_aligned/v0/img/img"},
...          "img_norm": {"@type": "build_cv_layer", "path": "https://storage.googleapis.com/fafb_v15_aligned/v0/img/img_norm"}
...       }
...    },
...    "sample_indexer": {
...        "@type": "VolumetricStridedIndexer",
...        "bbox": {
...           "@type": "BBox3D.from_coords",
...           "start_coord": [1000, 1000, 2000],
...           "end_coord": [2000, 2000, 2100],
...           "resolution": [64, 64, 40],
...        },
...        "resolution": [64, 64, 40],
...        "chunk_size": [128, 128, 1],
...        "stride": [32, 32, 1],
...        "mode": "shrink",
...    }
... }
>>> dset = builder.build(spec)
>>> sample = dset[0]
>>> print (list(sample.keys()))
['img', 'img_norm']
>>> print (sample['img'].shape)
torch.Size([1, 128, 128, 1])