forward range of rows
one of the predefined loss functions
optimizer to use on all learnable layers for training
whether or not to show progress during training
degree of Hogwild parallelism
whether or not loss value should be tracked during training for monitoring (slightly slower)
Train neural network on a dataset, using a predefined loss.