Is it though? There is a reason gpt has codex variants. RL on a specific task ra... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		runeblaze 4 days ago \| parent \| context \| favorite \| on: “Erdos problem #728 was solved more or less autono... Is it though? There is a reason gpt has codex variants. RL on a specific task raises the performance on that task

jjmarr 4 days ago [–]

Post-training doesn't transfer over when a new base model arrives so anyone who adopted a task-specific LLM gets burned when a new generational advance comes out.

runeblaze 3 days ago | [–]

Resouce-affording, if you are chasing the frontier of some more niche task you redo your training regime on the new-gen LLMs

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact