As opposed to inference (like generating text and images), training requires som...

sva_ · 2025-01-30T09:56:38 1738230998

> You give it a task and it does it for 1 billion trillion epochs and saves the changes incrementally (or not).

Somewhat confusingly, big LLM are most just trained for 1 epoch afaik.

_joel · 2025-01-30T10:55:23 1738234523

I've seen 3 epochs on some of the finetuning R1 blog posts. It's not my field so not sure how valid that is.

sva_ · 2025-02-01T10:13:10 1738404790

Yeah, fine-tuning is different from pretraining