virtex.data.datasets
Downstream Datasets
- class virtex.data.datasets.downstream.ImageNetDataset(data_root: str = 'datasets/imagenet', split: str = 'train', image_transform: Callable = Compose([SmallestMaxSize(always_apply=False, p=1.0, max_size=256, interpolation=1), CenterSquareCrop(always_apply=False, p=1.0, height=224, width=224), Normalize(always_apply=False, p=1.0, mean=(0.485, 0.456, 0.406), std=(0.229, 0.224, 0.225), max_pixel_value=255.0)], p=1.0, bbox_params=None, keypoint_params=None, additional_targets={}))[source]
Bases:
torchvision.datasets.imagenet.ImageNet
Simple wrapper over torchvision’s ImageNet dataset. Image transform is handled here instead of passing to super class.
- Parameters
data_root – Path to the ImageNet dataset directory.
split – Which split to read from. One of
{"train", "val"}
.image_transform –
List of image transformations, from either albumentations or
virtex.data.transforms
.
- class virtex.data.datasets.downstream.INaturalist2018Dataset(data_root: str = 'datasets/inaturalist', split: str = 'train', image_transform: Callable = Compose([SmallestMaxSize(always_apply=False, p=1.0, max_size=256, interpolation=1), CenterSquareCrop(always_apply=False, p=1.0, height=224, width=224), Normalize(always_apply=False, p=1.0, mean=(0.485, 0.456, 0.406), std=(0.229, 0.224, 0.225), max_pixel_value=255.0)], p=1.0, bbox_params=None, keypoint_params=None, additional_targets={}))[source]
Bases:
torch.utils.data.dataset.Dataset
A dataset which provides image-label pairs from the iNaturalist 2018 dataset.
- Parameters
data_root – Path to the iNaturalist 2018 dataset directory.
split – Which split to read from. One of
{"train", "val"}
.image_transform –
List of image transformations, from either albumentations or
virtex.data.transforms
.
- class virtex.data.datasets.downstream.VOC07ClassificationDataset(data_root: str = 'datasets/VOC2007', split: str = 'trainval', image_transform: Callable = Compose([SmallestMaxSize(always_apply=False, p=1.0, max_size=256, interpolation=1), CenterSquareCrop(always_apply=False, p=1.0, height=224, width=224), Normalize(always_apply=False, p=1.0, mean=(0.485, 0.456, 0.406), std=(0.229, 0.224, 0.225), max_pixel_value=255.0)], p=1.0, bbox_params=None, keypoint_params=None, additional_targets={}))[source]
Bases:
torch.utils.data.dataset.Dataset
A dataset which provides image-label pairs from the PASCAL VOC 2007 dataset.
- Parameters
data_root – Path to VOC 2007 directory containing sub-directories named
Annotations
,ImageSets
, andJPEGImages
.split – Which split to read from. One of
{"trainval", "test"}
.image_transform –
List of image transformations, from either albumentations or
virtex.data.transforms
.
- class virtex.data.datasets.downstream.ImageDirectoryDataset(data_root: str, image_transform: Callable = Compose([SmallestMaxSize(always_apply=False, p=1.0, max_size=256, interpolation=1), CenterSquareCrop(always_apply=False, p=1.0, height=224, width=224), Normalize(always_apply=False, p=1.0, mean=(0.485, 0.456, 0.406), std=(0.229, 0.224, 0.225), max_pixel_value=255.0)], p=1.0, bbox_params=None, keypoint_params=None, additional_targets={}))[source]
Bases:
torch.utils.data.dataset.Dataset
A dataset which reads images from any directory. This class is useful to run image captioning inference on our models with any arbitrary images.
- Parameters
data_root – Path to a directory containing images.
image_transform –
List of image transformations, from either albumentations or
virtex.data.transforms
.