pre-training

initial training on vast text data to learn language patterns before task-specific fine-tuning

pre-training in a sentence

“Pre-training on internet text gives the model broad world knowledge.”

Latin prae- before + Old French trainer to draw, drag

Related Words

fine-tuning

additional training on specific data to adapt a pre-trained model for particular tasks

RLHF

reinforcement learning from human feedback—training models using human preference judgments

supervised learning

training on labeled examples where correct outputs are provided

self-supervised learning

training where labels are derived from the data itself, like predicting masked words

loss function

a mathematical measure of how wrong the model's predictions are, minimized during training

gradient descent

an optimization algorithm that iteratively adjusts parameters to minimize loss