Translated PolyMNIST Dataset
- class multivae.data.datasets.TranslatedMMNIST(path, scale, translate, n_modalities, background_path=None, split='train', transform=ToTensor(), target_transform=None)[source]
Translated version of the PolyMNIST dataset. The data is built from background images that need to be downloaded beforehand.
The original PolyMNIST (5 modalities) background images can be downloaded from : https://mybox.inria.fr/d/78e581ee5b07402983fa/.
To use the ExtendedPolyMNIST dataset (10 modalities) introduced in “Score-Based Multimodal Autoencoder” (Wesego 2024), download the background images from https://github.com/rooshenasgroup/sbmae/tree/main/poly_background.
>>> from multivae.data.datasets import TranslatedMMNIST >>> dataset = TranslatedMMNIST( ... path = 'your_data_path', ... scale = 0.75, # downscale 75% ... translate = True, # random translation ... n_modalities = 5, ... background_path = 'path_to_background_image' ...)
- Parameters:
path (str) – parent path where to save the dataset
scale (float) – The scale factor to downsample the MNIST images
translate (bool) – Wether to translate the MNIST images
n_modalities (int) – The number of modalities. It must match the number of background images.
background_path (str, optional) – Path to the background images. Defaults to None.
split (str, optional) – train or test. Defaults to ‘train’.
transform (torchvision.transforms, optional) – The transform to apply to images. Defaults to ToTensor().
target_transform (torchvision.transform, optional) – The transform to apply to labels. Defaults to None.