More

MichaelDickens · 2026-03-11T15:35:20 1773243320

Economics has the Journal of Comments and Replications in Economics: https://jcr-econ.org/

MichaelDickens · 2026-03-08T17:37:32 1772991452

Altman has personally claimed that we are close to AGI. Therefore, according to him, OpenAI should invoke the self-sacrifice clause.

croes · 2026-03-08T20:02:34 1773000154

Of course he claims that, he seeks money from investors but the charter is likely be written by people who took it seriously

MichaelDickens · 2026-03-04T21:30:26 1772659826

OP says one query uses 0.3 Wh. Driving an electric car for 10 miles = 3,000 Wh which is roughly 10,000 Wh per hour.

I'm not sure how many queries is equivalent to an hour of Claude code use, but maybe 5 seconds, which means an hour of continuous use = 216 Wh, or ~50x less than an electric car.

OP has a longer article about LLM energy usage: https://hannahritchie.substack.com/p/ai-footprint-august-202...

recursive · 2026-03-05T01:23:27 1772673807

Beside the point, but 10,000 Wh per hour is kind of an insane unit. It's 10,000 watts. Or 10 kW if you're really into the whole brevity thing.

dr_dshiv · 2026-03-06T14:34:52 1772807692

My point is that Claude might easily be about 50x more energy intensive than normal ChatGPT prompting.

stratos123 · 2026-03-06T18:21:37 1772821297

A coding agent runs near-constantly, so of course it'd require a lot more compute than running even, say, a multi-minute query with a thinking model every hour. How much exactly is pretty hard to calculate because it requires some guesswork, but...

For a long input of n tokens from a model with N active parameters, the cost should scale as O(N n^2) (this is due to computing attention - for non-massive n, the O(N n) term is bigger, which is why API costs per token are fixed until a certain point and then start to rise). From the estimates from [1], it's around 40Wh for n=100k, N=100B. I multiply by 2.5 to account for Opus probably being ~2.5x larger than gpt-4o, and also multiply by 2 to pessimistically assume we're always close to Opus's soft context limit of 200k (it's possible to get a bigger context for extra cost, but I suspect people compact aggresively to not have to use it). That gets me 7.2J/t, which at a rough throughput estimate of 20t/s gives me power of 144W. Like a powerful CPU or a mediocre GPU, and still orders of magnitude lower than a car.

[1] https://epoch.ai/gradient-updates/how-much-energy-does-chatg...

MichaelDickens · 2026-03-01T16:40:47 1772383247

> It's a piece of software that predicts the most likely token, it is not and can never be conscious.

A brain is a collection of cells that transmit electrical signals and sodium. It is not and can never be conscious.

encomiast · 2026-03-01T18:48:40 1772390920

I think this is a useful way to look at things. We often point out that LLMs are not conscious because of x, but we tend to forget that we don't really know what consciousness is, nor do we really know what intelligence is beyond the Justice Potter Stewart definition. It's helpful to occasionally remind ourselves how much uncertainty is involved here.

cootsnuck · 2026-03-01T17:06:10 1772384770

Except an LLM actually is a piece of software. And the brain is not what you said.

philipswood · 2026-03-01T18:43:26 1772390606

Which part of what he said is wrong?

> A brain is a collection of cells that transmit electrical signals and sodium. ...

That it is a collection of cells? Or that they transmit electrical signals and sodium?

Or do you feel that he's leaving out something important about how it works (like generated electrical fields or neural quantum effects)?

MichaelDickens · 2026-03-01T03:28:38 1772335718

> I think agents should manage their own context too.

My intuition is that this should be almost trivial. If I copy/paste your long coding session into an LLM and ask it which parts can be removed from context without losing much, I'm confident that it will know to remove the debugging bits.

bbatha · 2026-03-01T04:40:47 1772340047

I generally do this when I arrive at the agent getting stuck at a test loop or whatever after injecting some later requirement in and tweaking. Once I hit a decent place I have the agent summarize, discard the branch (it’s part of the context too!) and start with the new prompt

MichaelDickens · 2026-03-01T03:25:49 1772335549

In my experience virtually every magazine is like this, not just Quanta. I open an article hoping to learn something about some scientific or mathematical discovery, but instead the article is almost entirely about the discoverer.

For learning about actual discoveries, YouTube is much better (Veritasium, Numberphile, 3Blue1Brown, ...).

MichaelDickens · 2026-02-27T18:41:31 1772217691

> Yes it was a pragmatic change, no it was not a change in their values. The commentary here on HN about Anthropic's RSP change was completely off the mark. They "think these changes are the right thing for reducing AI risk, both from Anthropic and from other companies if they make similar changes", as stated in this detailed discussion by Holden Karnofsky, who takes "significant responsibility for this change":

Can you imagine a world where Anthropic says "we are changing our RSP; we think this increases AI risk, but we want to make more money"?

The fact that they claim the new RSP reduces risk gives us approximately zero evidence that the new RSP reduces risk.

ozozozd · 2026-02-28T03:18:01 1772248681

Well, the original claim of risk was also evidence-free.

It’s fair because the folks who are making the claim never left the armchair.

versteegen · 2026-02-27T23:51:32 1772236292

That misses my point: the evidence is the extensive argumentation provided for why it reduces risk. To quote Karnofsky:

> I wish people simply evaluated whether the changes seem good on the merits, without starting from a strong presumption that the mere fact of changes is either a bad thing or a fine thing. It should be hard to change good policies for bad reasons, not hard to change all policies for any reason.

MichaelDickens · 2026-02-17T00:55:33 1771289733

It's both. Saving time is a form of status signaling. Professionalism usually entails spending longer on something than is optimal for effective communication, which is a way of signaling "my time is less valuable than yours". Writing short messages with grammatical errors is a way of signaling "my time is more valuable than your comprehension".

MichaelDickens · 2026-02-17T00:48:05 1771289285

> On the positive side of this, research papers by competent people read very clearly with readable sentences, while those who are afraid that their content doesn't quite cut it, litter it with jargon, long complicated sentences, hoping that by making things hard, they will look smart.

I often find that to be true. Another important factor is that research skill is correlated with writing skill. Someone who's at the top of their field is likely to be talented in other ways, too, and one such talented is making complex topics easier to understand.

MichaelDickens · 2026-02-09T01:19:10 1770599950

I would think that, by default, noise would not have a bias? Adding noise doesn't change the mean, it just increases the variance, right?

tfirst · 2026-02-10T03:20:14 1770693614

The Wikipedia page on this is not bad: https://en.wikipedia.org/wiki/Regression_dilution

namibj · 2026-02-09T02:40:02 1770604802

It pushes confidence bounds closer to the null hypothesis.