trainer

The trainer config group sets the flags for the lightning.Trainer used by mml The full documentation can be found here.

default_trainer

_target_

default: lightning.Trainer

the trainer is instantiated by the scheduler through create_trainer()
any kwargs accepted by Trainer can be provided as trainer.KWARG=VALUE
callbacks are determined separately through callbacks (see there for details on checkpointing)
the experiment logger is determined through logging
see Trainer in the lightning documentation

benchmark

default: True

precision

default: 16-mixed

min_epochs

default: 10

will block “early stopping” and similar from interrupting the training until this number of epochs is reached

max_epochs

default: 50

enable_model_summary

default: true

num_sanity_val_steps

default: 0

max_time

default: null

accelerator

default: auto

determine the hardware accelerator, “auto” will choose depending on available hardware

devices

default: 1

number of hardware devices, currently mml is not yet optimized for multi-GPU usage