More

energy123 · 2026-02-04T17:29:32 1770226172

It converges on conventional attention as P goes up

energy123 · 2026-02-04T16:20:56 1770222056

It's like claims of room temperature superconductors or millenium prize solutions. Earth shattering if true. It'd be such a black swan. Terrible for Nvidia.

SeanAnderson · 2026-02-04T16:49:39 1770223779

Well, we solved one of the Millennium Prize problems (honestly kinda quickly) so maybe there's hope :)

energy123 · 2026-02-04T15:27:11 1770218831

> this is where the taylor expression would fail to represent the values well.

"In practice, we find that four Taylor terms (P = 4) suffice for recovering conventional attention with elementwise errors of approximately the same magnitude as Float16 resolution"

seanhunter · 2026-02-04T15:49:17 1770220157

I read that too, but I wondered whether elementwise error is the right metric. Surely the actual error metric should be to evaluate model performance for a conventional transformer model and then the same model with the attention mechanism replaced by this 4th order Taylor approximation?

vlovich123 · 2026-02-04T16:27:46 1770222466

Bounded error weights by definition is a more strict evaluation criterion than “performance” metrics through running the model.

ehsanu1 · 2026-02-04T20:06:10 1770235570

To spell it out for myself and others: approaching equivalent calculations for each individual attention block means we also approach equivalent performance for the combination of them. And with an error bar approaching floating point accuracy, the performance should be practically identical to regular attention. Elementwise errors of this magnitude can't lead to any noteworthy changes in the overall result, especially given how robust LLM networks seem to be to small deviations.

energy123 · 2026-02-04T13:40:32 1770212432

> 5. Energy production will be cheaper from earth

Sun-synchronous orbit means solar panels collect the same amount 24/7. I guess that's the #1 benefit. Cheap energy.

donkey_brains · 2026-02-04T13:57:24 1770213444

Read the whole sentence. He’s talking about the cost to make solar panels that can be deployed in space, not the efficacy of said panels.

energy123 · 2026-02-04T12:59:02 1770209942

There was some credible analysis that I don't have the link to which estimated 50% gross margins for OpenAI, largely eaten up by operational expenses. So not awful unit economics, but not good either.

Assuming that's even true, the big asterisk is uncertainty around efficiency gains in the future. The intelligence divided by cost ratio is changing very quickly. It is hard to make confident predictions more than 3 months out.

energy123 · 2026-02-04T07:37:11 1770190631

What in particular is wrong/misleading in the Starcloud whitepaper, then?

https://starcloudinc.github.io/wp.pdf

beloch · 2026-02-04T07:55:03 1770191703

In Table 1, the cost of cooling of a terrestrial data centre is listed as $7M. The cost of cooling in space is assigned a value of $0 with the claim:

"More efficient cooling architecture taking advantage of higher ΔT in space"

My bold claim: The cost of cooling will not be $0. The cost of launching that cooling into space will also not be $0. The cost of maintaining that mechanically complex cooling in space will not be $0.

They then throw in enough unrealistic calculations later in the "paper" to show that they thought about the actual cost at least a little bit. Apparently just enough to conclude that it's so massive there's no way they're going to list it in the table. Table 1 is pure fantasy.

WithinReason · 2026-02-04T08:51:13 1770195073

That row specifically says "chiller energy cost" which is 0

trymas · 2026-02-04T07:56:59 1770191819

Previous discussions on HN: - https://news.ycombinator.com/item?id=44390781

- https://news.ycombinator.com/item?id=45667458

- https://news.ycombinator.com/item?id=43977188

I will not re-read them, but from what I recall from those threads is numbers don't make sense. Something like:

- radiators the multiple square kilometers in size, in space;

- lifting necessary payloads to space is multiples of magnitudes more than we have technology/capacity as the whole world now;

- maintanence nightmare. yeah you can have redundancy, but no feasable way to maintain;

- compare how much effort/energy/maintenance is required to have ISS or Tiangong space stations - these space datacenters sound ridiculous;

NB: I would be happy to be proven wrong. There are many things that are possible if we would invest effort (and money) into it, akin to JFK's "We choose to go to the Moon" talk. Sounded incredible, but it was done from nearly zero to Moon landing in ~7 years. Though as much as I udnerstand - napkin math for such scale of space data centers seem to need efforts that are orders or magnitude more than Apollo mission, i.e. launching Saturn V for years multiple times per day. Even with booster reuse technology this seems literally incredible (not to mention fuel/material costs).

red75prime · 2026-02-04T13:00:01 1770210001

A giant space datacenter with square kilometers of solar panels doesn't make sense. A cluster of Starlink-sized satellites, which orbit near each other(1) and which are connected using laser-links might make sense.

(1) There are orbital arrangements that allow satellites to stay close together with minimal orbital corrections. Scott Manley mentioned this in one of his videos.

trymas · 2026-02-04T13:07:34 1770210454

Sounds like we would want to elevate from water wasting on Earth to pollution in space.

mmoustafa · 2026-02-04T07:49:07 1770191347

They do not at any point outline how cooling will be done, they simply say "it will be more efficient than chillers due to the larger delta T" which is incorrect because it's about dT not delta T

deepfriedchokes · 2026-02-04T07:46:42 1770191202

Probably this bit on page 4, which parent comment addresses: “More efficient cooling architecture taking advantage of higher ΔT in space.”

energy123 · 2026-02-04T03:40:31 1770176431

Does that emit more than Elon's terrestrial data centers powered by natural gas, per unit of compute?

energy123 · 2026-02-04T02:48:21 1770173301

Debunked by Steven Pinker:

https://whyevolutionistrue.com/2019/01/31/is-the-world-reall...

ashivkum · 2026-02-04T03:00:29 1770174029

Debunked by Jason Hickel:

https://www.jasonhickel.org/blog/2019/2/3/pinker-and-global-...

energy123 · 2026-02-04T03:11:09 1770174669

Hickel's thoughts on this matter were torn to shreds on HN back in 2019, I'm not going to relitigate it now.

ashivkum · 2026-02-04T03:22:35 1770175355

I've seen Pinker's arguments dismantled too. The blog whose post we're commenting on even has a piece dismantling the totally made up GDP numbers coming out of Africa.

energy123 · 2026-02-03T13:51:06 1770126666

Double the codex limits for 2 months is very compelling. The limits are already generous.

energy123 · 2026-02-03T11:29:14 1770118154

I want to see regulation of the algorithm. Something like forcing a chronological feed, or somehow nerfing the recommendation engine. Figure out a way to make it boring, bypass the whole censorship debate.

andy81 · 2026-02-03T11:32:28 1770118348

There are more options -

Banning advertising targeted at the user rather than the context

Enforcing Do Not Track

Enforcing GDPR (especially sites that use cookie banners)