
pre-training
initial training on vast text data to learn language patterns before task-specific fine-tuning
“Pre-training on internet text gives the model broad world knowledge.”
Origin: Latin prae- before + Old French trainer to draw, drag

initial training on vast text data to learn language patterns before task-specific fine-tuning
“Pre-training on internet text gives the model broad world knowledge.”
Origin: Latin prae- before + Old French trainer to draw, drag