Contents
API
The optimizer config group determines the torch-optim optimizer used for backpropagation during model training. For now only a single optimizer and single parameter group is supported.
optimizer
torch-optim
The default optimizer by mml.
mml
Adam
the Adam optimizer
see Adam
coefficients used for computing running averages of gradient and its square
the initial learning rate
denominator summand for numerical stability
L2 penalty
SGD
stochastic gradient descent optimizer
see SGD
momentum factor
dampening for momentum