More

solomatov · 2026-04-28T15:35:13 1777390513

It would have been better if they provided not just weights, but also some frontend where it is usable as is.

solomatov · 2026-04-20T18:37:04 1776710224

>but I have seen the local 122b model do smarter more correct things based on docs than opus

Could you please share more about this

alex7o · 2026-04-20T21:35:42 1776720942

Maybe a bit misleading. I have used in in two places.

One Is for local opencode coding and config of stuff the other is for agent-browser use and for both it did better (opus 4.6) for the thing I was testing atm. The problem with opus at the moment I tired it was overthinking and moving itself sometimes I the wrong direction (not that qwen does overthink sometimes). However sometimes less is more - maybe turning thinking down on opus would have helped me. Some people said that it is better to turn it of entirely when you start to impmenent code as it already knows what it needs to do it doesn't need more distraction.

Another example is my ghostty config I learned from queen that is has theme support - opus would always just make the theme in the main file

solomatov · 2026-04-16T22:58:06 1776380286

Just curious, the fixes are not about weights but about templates, am I right?

danielhanchen · 2026-04-17T08:27:25 1776414445

Yes so chat templates and the actual implementations

solomatov · 2026-04-16T17:34:10 1776360850

Did anyone try it and Gemma 4? Does it feel that it's better than Gemma 4?

solomatov · 2026-04-14T20:15:39 1776197739

Does anyone has any tips for starting with Gastown? I am comfortable with couple of agents running, but not yet comfortable with what Gastown offers.

peddling-brink · 2026-04-14T21:10:57 1776201057

Set a budget. Fund an openrouter account with the max you can stomach spending on this test and give it a shot.

At least, that’s what I would do, if I had any interest in testing out gastown with my own money. If my employer wants to pay for the testing, that’s another question entirely.

solomatov · 2026-04-14T21:11:45 1776201105

I mean not how to do it, it's not that hard, but how to be productive with it.

solomatov · 2026-04-14T00:35:13 1776126913

> more like supervising 8-15 agents

How do they do it? (My own record is 5 agents, but it is not typical). Do they use gastown or something?

azinman2 · 2026-04-14T00:37:29 1776127049

I often have 10+ running in parallel. I’m attacking parallel problems that aren’t interdependent. Sometimes adding additional products can bring me up to 15+.

Gotta have really good test harnesses so they can largely fix themselves.

solomatov · 2026-04-14T00:38:22 1776127102

But how do you cover such amount of multi tasking? Could you give an example? I mean what kind of tasks allow such a parallelization?

htrp · 2026-04-14T00:47:04 1776127624

context switching across the entirety of the feature surface for an app

You could easily have agents to work on login page, messaging feature, database/data model update, recommender system, backend api, etc

jondwillis · 2026-04-14T00:56:27 1776128187

We have our doubts about this. Can you share your code or product? Anecdotally, my mistakes and lack of understanding exponentiate the more I try to parallelize.

azinman2 · 2026-04-14T14:27:39 1776176859

Who is “we”?

As I said in the neighboring comment, for vibe coding side projects and prototypes for work I just merge and iterate. It works out more than it doesn’t. For anything bigger at work I cannot share as I’m at Apple.

solomatov · 2026-04-14T00:50:12 1776127812

But you have to keep it in your head, and remember all stuff at the same time. How is it possible to track, and do reviews one after another? Or are these pretty long running agents?

azinman2 · 2026-04-14T14:25:51 1776176751

I’m not sure what you mean by keep it in your head? I know all of the parts the agents are working on. It’ll often be a mix between bigger tasks (some large refactor, new feature, etc) and small tasks (little bug fixes).

For prototyping I just merge. I don’t bother to review the code. For anything more important than I am reviewing the code and going back and forth. Basically there’s a queue of stuff demanding my attention, and I just serially go through them.

What’s also been really helpful to me is /simplify and similar code review skills (I have my own). That alone takes an agent a while to parse through everything it’s done and self reviews. It catches quite a lot itself this way.

solomatov · 2026-04-14T16:42:42 1776184962

>I’m not sure what you mean by keep it in your head?

If the project I work on is large enough, it takes me some time to get everything I need to understand for review into the short term memory. If it's small enough, it's less of a problem for me.

aanet · 2026-04-14T01:45:59 1776131159

Honestly, I dont know. I could be mistaken about the exact number of agents - but not wrong about fact of AI-driven workflows which is heavily automated, and goes on for hours.

He's one (small) step from distinguished engineer, with 20+ patents to his name, and is an embedded programmer (largely C/C++) with 30+ years of experience in the field; and I've known him for nearly as long, so I put a lot of credence to his words.

But we don't usually talk work; he's the guitarist in our band :) [I'm the bass] So we mainly chill over music + beer. And lately, it's been less chill ¯\_(ツ)_/¯

solomatov · 2026-04-14T00:21:59 1776126119

Is there any publication which demonstrates that the improvement is really 10x?

ggm · 2026-04-14T00:29:25 1776126565

It's like "decimate" -you would think 10x had literal force, but it's more figurative. It just means "moar"

(decimate had specific literal intent. Now it's just a force modifier like bigly)

peterashford · 2026-04-14T00:37:37 1776127057

The literal meaning was removing 1/10

nemosaltat · 2026-04-14T00:45:48 1776127548

> Removing 1/10

feels euphemistic for the original “colloquial” usage I have for it.

> The killing of one in ten, chosen by lots, from a rebellious city or a mutinous army was a punishment sometimes used by the Romans. The word has been used (loosely and unetymologically, to the irritation of pedants) since 1660s for "destroy a large but indefinite number of." [0]

[0] https://www.etymonline.com/word/decimate

peterashford · 2026-04-14T08:26:47 1776155207

Yup. What amuses me is that people think that decimate is to massively degrade something. I assume they're thinking "reduce to 1/10th" rather than "reduce to 9/10th". The effect is markedly different

zetanor · 2026-04-14T00:55:29 1776128129

A watched pot never boils. A watched vibe coder never 10x-es.

solomatov · 2026-04-13T15:00:44 1776092444

What this crate could be used for?

aerzen · 2026-04-14T06:33:40 1776148420

For converting HTTP URLs into interactive images of the webpage.

In other words: an internet browser.

solomatov · 2026-04-10T00:44:19 1775781859

Could you recommend which quantization level to use with it?

solomatov · 2026-03-20T22:38:46 1774046326

Does github copilot ToS allow this?

fresh_broccoli · 2026-03-20T23:18:13 1774048693

They officially support OpenCode: https://github.blog/changelog/2026-01-16-github-copilot-now-...

edg5000 · 2026-03-21T05:37:23 1774071443

This is very interesting. This could allow custom harnesses to be used economically with Opus. Depending on the usage limits, this may be cheaper than their API.

swingboy · 2026-03-20T22:46:45 1774046805

I don't see why not. It's just using the Github Copilot API.