Stochastic Gradient Descent (SGD)

Stochastic Gradient Descent (SGD) #

SGD uses mini-batches to trade exact gradients for speed and generalisation.


Home | Nonlinear Optimisation