Topic cluster · 2 items

training

glossary_term

Fine-tuning

Further training a pretrained model on a smaller, specific dataset.

paper

Grokking in small transformers

When and why tiny models suddenly generalize long after overfitting.

Related topics