More

Marazan · 2026-05-09T20:01:19 1778356879

PipeDream was a wild piece of software. Even as a young teenager in the early 90s I could tell it was a weird paradigm.

Marazan · 2026-05-04T08:24:11 1777883051

If you remove the auxiliary tools and just leave the core LLM then strawberry still has an undefined number of `r`s in it.

p-e-w · 2026-05-04T08:35:38 1777883738

That’s false. Larger LLMs learn token decompositions through their training, and in fact modern training pipelines are designed to occasionally produce uncommon tokenizations (including splitting words into individual characters) for this reason. Frontier models have no trouble spelling words even without tools. Even many mid-sized models can do that.

kilpikaarna · 2026-05-04T09:11:29 1777885889

Wait, where can I learn more about this? I don't doubt that varying the tokenization during training improves results, but how does/would that enable token introspection?

p-e-w · 2026-05-04T11:44:31 1777895071

Because LLMs can learn that different token sequences represent the same character sequence from training context. Just like they learn much more complex patterns from context.

You can try this out locally with any mid-sized current-gen LLM. You’ll find that it can spell out most atomic tokens from its input just fine. It simply learned to do so.

Marazan · 2026-04-23T21:25:29 1776979529

> "incomes are up!". Great, have they increased by as much as inflation?

Yes.

> Can I afford a home?

No.

There's an important lesson somewhere here.

Marazan · 2026-04-17T21:35:13 1776461713

> It will be relevant for a long time.

Citation needed.

Marazan · 2026-04-17T07:49:21 1776412161

> (so infrastructure for clean water and all chemicals)

Fabs are some of the most complex chemical engineering sites (dealing with some of the most dangerous substances) in the world. So don't underestimate the complexity of this part.

utopiah · 2026-04-17T11:53:56 1776426836

Well that was part of my point, not everybody is TSMC. It's not "just" getting an ASML machine and voila, you're good to go.

Marazan · 2026-04-02T09:54:15 1775123655

Google was not marginally better.

I was orders of magnitude better. It is that simple.

Marazan · 2026-04-01T19:08:07 1775070487

Every year I ask the latest version of Chat GPT a basic facts question about rugby results. It almost always gets it wrong - even when it does web search and cites sources. Wrong scores, hallucinated matches, wrong locations - just gob smacking amounts of wrongness.

The latest "Thinking" version gets it reliably right but spent about 3 minutes coming up with the answer that 10 seconds of googling answers.

So I don't believe we are currently in a situation where LLMs are an effective replacement for search engines.

greenchair · 2026-04-01T21:24:57 1775078697

yep google ai results are old too.

Marazan · 2026-04-01T18:59:13 1775069953

VisiCalc was the killer app.

zozbot234 · 2026-04-01T22:08:23 1775081303

VisiCalc was "the" killer app for early micros, but being able to edit a written text on screen and then print it out with letter-like quality was nothing to sneeze at, either. This was plausibly a key gain in efficiency for the service sector, perhaps comparable to the 10%~25% that's now being talked about re: LLM's (which is huge on a secular basis).

ianbutler · 2026-04-01T19:02:08 1775070128

Ah got it. I wasn’t drawing that connection. Thanks

Marazan · 2026-03-25T23:08:52 1774480132

"Very simplistic prompt" is the absolute and total core of this and the thing that ensures validity of the whole exercise.

If you are trying to measure GENERAL intelligence then it needs to be general.

Marazan · 2026-03-25T10:21:34 1774434094

Well, gotta hand it to the haters on this one.