More

tobr · 2026-03-24T11:00:02 1774350002

Your argument that it’s a shallow argument is itself a shallow argument. ”I hate x” is not a technical argument anyway, it’s an emotional assessment.

mexicocitinluez · 2026-03-24T20:20:12 1774383612

But they're shilling a technical solution not an emotional one.

usrbinenv · 2026-03-24T20:30:47 1774384247

As much as we like to think of ourselves as rational beings, emotions are still a very large part of our decision making process. I didn't build Qite because I hate React, I built it because I knew exactly how I wanted things to work. But I do hate React and it's part of why I knew exactly how I wanted things to work.

mexicocitinluez · 2026-03-24T22:41:48 1774392108

> As much as we like to think of ourselves as rational beings, emotions are still a very large part of our decision making process

And yet, plenty of people all around the world are able to get traction for their products without mentioning the hate of another.

> I didn't build Qite because I hate React,

I get that React being the most popular front-end framework means it's going to get it's fair share of criticism, but it's become pathetic the degree to which people have made hating it their personality. Even going so far as to market their own frameworks in terms of their personal feelings towards it.

Nobody is saying humans aren't emotional, you're trying to deflect from being unable to disconnect your emotions from another library.

It's React Derangement Syndrome.

tobr · 2026-03-21T19:00:16 1774119616

Interesting to note how similar this seems to what happened with Benj Edwards at Ars Technica. AI was used to extract or summarize information, and quotes found in the summary were then used as source material for the final writing and never double checked against the actual source.

I’ve run into a similar problem myself - working with a big transcript, I asked an AI to pull out passages that related to a certain topic, and only because of oddities in the timestamps extracted did I realize that most of the quotes did not exist in the source at all.

raw_anon_1111 · 2026-03-21T20:16:06 1774124166

This seems like a solved problem. Any RAG interface I design I have links to the original source and passage. Even NotebookLM does this.

mh- · 2026-03-21T20:17:39 1774124259

For the curious, the term of art is Grounding.

e.g.: https://docs.cloud.google.com/vertex-ai/generative-ai/docs/g...

tobr · 2026-03-21T21:21:33 1774128093

It might be a solved problem in the sense that it has a possible solution, but not in the sense that it doesn’t happen with the tools most people would expect to be able to handle the task.

Peritract · 2026-03-21T20:27:57 1774124877

It was already a solved problem with cmd/ctrl + f.

tobr · 2026-03-16T17:07:35 1773680855

Microwaves are for heating, ovens are for cooking. Obviously it’s possible to live on only microwaved food but it sounds pretty miserable.

tobr · 2026-03-16T14:24:01 1773671041

Part 1 discussion, December 2024: https://news.ycombinator.com/item?id=42343953

tobr · 2026-03-14T19:21:12 1773516072

Meanwhile, iPhone is still using this design https://xkcd.com/1884/

tobr · 2026-03-14T08:43:24 1773477804

Describing what computers do as ”thinking” is not new. It’s a useful and obvious metaphor. https://www.gutenberg.org/ebooks/68991

ldng · 2026-03-14T13:09:45 1773493785

It is a deceitful metaphor.

tobr · 2026-03-12T07:57:06 1773302226

That’s an interesting approach, but what do you learn from it that is applicable to the next task? Do you find that this eventually boils down to heuristics that generalize to any task? It sounds like it would only work because you already put a lot of effort into understanding the constraints of the specific problem in detail.

tobr · 2026-03-12T07:44:26 1773301466

I wonder why they fail this specific way. If you just let them do stuff everything quickly turns spaghetti. They seem to overlook obvious opportunities to simplify things or see a pattern and follow through. The default seems to be to add more, rather than rework or adjust what’s already in place.

samdjstephens · 2026-03-12T08:46:15 1773305175

I suspect it has something to do with a) the average quality of code in open source repos and b) the way the reward signal is applied in RL post-training - does the model face consequences of a brittle implementation for a task?

I wonder if these RL runs can extend over multiple sequential evaluations, where poor design in an early task hampers performance later on, as measured by amount of tokens required to add new functionality without breaking existing functionality.

foo42 · 2026-03-12T11:04:41 1773313481

Yeah I've been wondering if the increasing coding RL is going to draw models towards very short term goals relative to just learning from open source code in the wild

catlifeonmars · 2026-03-12T12:58:07 1773320287

To me this seems like a natural consequence of the next-token prediction model. In one particular prompt you can’t “backtrack” once you’ve emitted a token. You can only move forwards. You can iteratively refine (e.g the agent can one shot itself repeatedly), but the underlying mechanism is still present.

I can’t speak for all humans, but I tend to code “nonlinearly”, jumping back and forth and typically going from high level (signatures, type definitions) to low level (fill in function bodies). I also do a lot of deletion as I decide that actually one function isn’t needed or if I find a simpler way to phrase a particular section.

Edit: in fact thinking on this more, code is _much_ closer to a tree than sequence of tokens. Not sure what to do with that, except maybe to try a tree based generator which iteratively adds child nodes.

tobr · 2026-03-12T15:06:59 1773328019

This would make sense to me as an explanation when it only outputs code. (And I think it explains why code often ends up subtly mangled when moved in a refactoring, where a human would copy paste, the agent instead has to ”retype” it and often ends up slightly changing formatting, comments, identifiers, etc.)

But for the most part, it’s spending more tokens on analysis and planning than pure code output, and that’s where these problems need to be caught.

catlifeonmars · 2026-03-14T15:48:20 1773503300

I feel like planning is also inherently not sequential. Typically you plan in broad strokes, then recursively jump in and fill in the details. On the surface it doesn’t seem to be all that much different than codegen. Code is just more highly specified planning. Maybe I’m misunderstanding your point?

OtomotO · 2026-03-12T07:48:36 1773301716

All it does is generate soup. Some of which may taste good.

There is no thinking, no matter what marketing tells you.

Antibabelic · 2026-03-12T09:07:12 1773306432

LLMs are next token predictors. Their core functionality boils down to simply adding more stuff.

logicchains · 2026-03-12T12:29:13 1773318553

They do what you tell them to. If you regularly tell them to look for opportunities to clean up/refactor the code, they will.

tobr · 2026-03-11T11:06:28 1773227188

I also expected hardware to be involved. But in the context of a list of tutorials on how to use this live coding tool the title makes sense though.

tobr · 2026-03-09T10:21:57 1773051717

And at the same time, the fastest growing consumer product of all time is called ”ChatGPT”.

jl6 · 2026-03-09T10:47:36 1773053256

Perhaps if the product is compelling enough, the name doesn’t matter - and conversely, if the product is borderline, it had better have a great name.

jmogly · 2026-03-09T11:08:20 1773054500

Chat gpt is a great name though — you “chat” with the “GPT” so its self informing (even if you dont know what a GPT is), it’s 4 syllables that roll off the tongue well together.

RSS, has no vowels, no information, and looks like an alphabet term you might see at the doctor’s office or in an HR onboarding form at a corpo.

wiether · 2026-03-09T15:03:05 1773068585

Randos are just calling it "Chat" now.

"I'll ask Chat about x!"

msephton · 2026-03-10T01:28:53 1773106133

In Japan it's now known colloquially as 「チャッピー」 ("Chappy" or "Chappie"). High praise that it has received such shortened and personified version so quickly.

tobr · 2026-03-09T15:15:56 1773069356

It’s the new ”I looked it up on wiki”.

youniverse · 2026-03-09T23:02:13 1773097333

I've heard 'just ai it' from high schoolers.