Who gets the next generation "up to speed" if the teachers are always forgetting?
Small models are already known to be more performative.
This is still just physics. Bigger the data set more likely to find false positives.
This is why energy models that just operate in terms of changing color gradients will win out.
LLMs are just a distraction for terminally online people
Who gets the next generation "up to speed" if the teachers are always forgetting?