mml.core.scripts.schedulers.preprocess_scheduler
- class PreprocessScheduler[source]
AbstractBaseScheduler implementation for the process of preprocessing data. Includes the following subroutines: - preprocess
- preprocess_and_store(index: int, ds: TaskDataset, target_base: Path, transform: AlbumentationsAugmentationModule) Dict[str, bool][source]
Function to preprocess and store a single data tuple of a dataset.
- Parameters:
ds – dataset
target_base – root path to store preprocessed data
index – index of data tuple to be preprocessed
transform – an albumentations transform module to be applied on loaded data
- Returns:
dict with modality keys and boolean indicating whether a file already existed beforehand