It's actually common to be in that situation where the grid is paying pennies on the dollar and you have extra generation. Most grid-tie systems are in that boat.
Suddenly you find yourself looking for something to spend power on so it doesn't go to waste.
Is autocomplete using LLMs really useful? Even with frontier models I found it to be about 50% right, I turned it of and prefer to use IntelliJ built-in, it is way more reliable.
For me local models is all about quality, and how to achieve that - e.g. by providing guardrails that test the job done.
People are using 3090 (24GB) to run models, and it is the most cost effective way to run the. Yes, it is 2x faster, but memory wise you surely can spend 24gb on llm.
Also there are smaller, still usefull models that can run on 8GB or less.
As for oprncode, doesn't the system prompt eat too much of the context? Local models are really constraint in regards contex, and opencode AFAIR uses a 10k of it or some thing close.
AFAIR it is not clear, because they write it is "30 days, but ...":
> After 30 days, the data is deleted automatically, except in the rare cases where it's part of a safety investigation or we're legally required to keep it.
So you have a vague clause saying "when" and vague clause saying for "how long". If it will fly I would be surprised.
But why make an app when websites is enough? And I don't need to run n web browsers for that.
reply