What is a learning rate?
The learning rate controls how large each parameter update step is during training.
- Too high can overshoot
- Too low learns slowly
- Often tuned experimentally
What is a learning rate?
Tagged with training
The learning rate controls how large each parameter update step is during training.
What is a learning rate?
Gradient descent updates model parameters in the direction that reduces the loss function.
What is gradient descent?
Transfer learning starts from a pretrained model and adapts it to a new but related task.
What is transfer learning?