Data Transforms API Flow#
This gives the rough flow of data outlining the data augmetnation procedure to train a SKOOTS model.
- class skoots.train.dataloader.dataset(path, transforms=<function dataset.<lambda>>, pad_size=100, device='cpu', sample_per_image=1)[source]
Custom dataset for loading and accessing skoots training data. This class loads data based on filenames and specific extensions: ‘.tif’ (raw image), ‘.labels.tif’ (instance masks), ‘.skeletons.tif’ (precomputed skeletons). An example training data folder might contain the following:
data\ └ train\ │ train_data.tif │ train_data.labels.tif └ train_data.skeletons.tif- Parameters:
path (
Union[List[str],str]) – Path to training datatransforms (
Optional[Callable[[Dict[str,Tensor]],Dict[str,Tensor]]]) – A function which applies dataset augmentation on a data_dictpad_size (
Optional[int]) – padding to add to every image in the datasetdevice (
Optional[str]) – torch.device which to output all data onsample_per_image (
Optional[int]) – number of times each image/mask pair is sampled per iteration over a dataset
- map(fn, key)[source]
applies a fn to an internal datastructure, provided by key. valid keys: [‘image’, ‘background’, ‘skele_masks’, ‘skeletons’]
- Return type:
- subtract_square_sum(other)[source]
returns the sum of the entire dataset, each px subtracted by other :type other: :param other: :return:
- to(device)[source]
Sends all data stored in the dataloader to a device.
- Parameters:
device (
str) – torch device for images, masks, and skeletons- Returns:
self
- class skoots.train.dataloader.MultiDataset(*args)[source]
A utility class for joining multiple datasets into one accessible class. Sometimes, you may subdivide your training data based on some criteria. The most common is size: data from folder data/train/train_alot must be sampled 100 times per epoch, while data from folder data/train/train_notsomuch might only want to be sampled 1 times per epoch.
You could construct a two skoots.train.dataloader.dataset objects for each and access both in a single MultiDataset class…
>>> from skoots.train.dataloader import dataset >>> >>> # has one image sampled 100 times >>> data0 = dataset('data/train/train_alot', sample_per_image=100) >>> print(len(data0)) # 100 >>> >>> # has one image sampled once >>> data1 = dataset('data/train/train_notsomuch', sample_per_image=1) >>> print(len(data1)) # 1 >>> >>> merged_data = MultiDataset(data0, data1) >>> print(len(merged_data)) # 101, they've been merged!
- Parameters:
args –
- cpu()[source]
alias for self.to(‘cpu’)
- Return type:
- cuda()[source]
alias for self.to(‘cuda:0’)
- Return type:
- map(fn, key)[source]
- Return type:
- mean(with_invert=False)[source]
- numel(with_invert=False)[source]
- std(with_invert=False)[source]
- sum(with_invert=False)[source]
- to(device)[source]
Sends all data stored in the dataloader to a device. Occurs for ALL wrapped datasets.
- Parameters:
device (
str) – torch device for images, masks, and skeletons- Return type:
- Returns:
self
- skoots.lib.skeleton.skeleton_to_mask(skeletons, shape, device=None, radius=7, flank_radius=3)[source]
Converts a skeleton Dict to a skeleton mask which can simply be regressed against via Dice loss or similar…
- Shapes:
skeletons: [N, 3]
shape \((3)\)
returns: math:(1, X_{in}, Y_{in}, Z_{in})
- Parameters:
skeletons (
Dict[int,Tensor]) – Dict of skeletonsshape (
Tuple[int,int,int]) – Shape of final maskdevice (
Union[device,str,None]) – output device
- Return type:
Tensor- Returns:
Mask of embedded skeleton px
- skoots.lib.skeleton.bake_skeleton(masks, skeletons, anisotropy=(1.0, 1.0, 1.0), average=True, device='cpu', return_distance=False)[source]
For each pixel \(p_ik\) of object \(k\) at index \(i\in[x,y,z]\) in masks, returns a baked skeleton where the value at each index is the closest skeleton point \(s_{jk}\) of any instance \(k\).
This should reflect the ACTUAL spatial distance of your dataset for best results…These models tend to like XY embedding vectors more than Z. For anisotropic datasets, you should roughly provide the anisotropic correction factor of each voxel. For instance anisotropy of (1.0, 1.0, 5.0) means that the Z dimension is 5x larger than XY.
Formally, the value at each position \(i\in[x,y,z]\) of the baked skeleton tensor \(S\) is the minimum of the euclidean distance function \(f(a, b)\) and the skeleton point of any instance:
\[S_{i} = min \left( f(i, s_{k})\right)\ for\ k \in [1, 2, ..., N]\]- Shapes:
masks: \((1, X_{in}, Y_{in}, Z_{in})\)
skeletons: \((3, N_i)\)
anisotropy: \((3)\)
returns: \((3, X_{in}, Y_{in}, Z_{in})\)
- Parameters:
masks (
Tensor) – Ground Truth instance mask of shape [1, X, Y, Z] of objects where each pixel is an integer id value.skeletons (
Dict[int,Tensor]) – Dict of skeleton indicies where each key is a unique instance of an object in mask. - Each skeleton has a shape [3, N] where N is the number of pixels constituting the skeletonanisotropy (
Tuple[float,float,float]) – Anisotropic correction factor for min distance calculationaverage (
bool) – Average the skeletons such that there is a smooth transition form one area to the nextdevice (
str) – torch.Device by which to run calculationsreturn_distance (
bool) – if true and bake_skeletons is dispatching the triton kernel, returns the distance to each closest skeleton
- Return type:
Union[Tensor,Tuple[Tensor,Tensor]]- Returns:
Baked skeleton