More

sbarre · 2025-12-09T13:18:59 1765286339

To be fair, the whole job market has changed. Layoffs and the death of "a job for life" is not unique to IBM.

I think the pace of progress and innovation has, for better or worse, meant that companies can no longer count on successfully evolving only from the inside through re-training and promotions over the average employee's entire career arc (let's say 30 years).

The reality is that too many people who seek out jobs in huge companies like IBM are not looking to constantly re-invent themselves and learn new things and keep pushing themselves into new areas every 5-10 years (or less), which is table stakes now for tech companies that want to stay relevant.

ownagefool · 2025-12-09T14:33:43 1765290823

Honestly, I think that's people reacting to the market more than it's the market reacting to people.

If your average zoomer had the ability to get a job for life that paid comparably well by a company that would look after them, I don't think loyalty would be an issue.

The problem is today, sticking with a company typically means below market reward, which is particularly acute given the ongoing cost of living crises affecting the west.

sbarre · 2025-12-09T02:43:20 1765248200

Is this just your belief presented as fact, or do you have some data to back this up?

(Not the literacy stat but the fact that illiterate people "figure out how to use tech by memorizing the icons and locations of buttons").

Arch485 · 2025-12-09T03:35:45 1765251345

Well, if you're unable to read, you're not going to figure out what the buttons do by reading the textual labels :p

Further, if you have difficulty reading, it's easier to parse the meaning of an abstract symbol, so you'd use that instead of a textual label when available. (I say this as someone who is a really slow reader. I use icons when I can)

mercury4063 · 2025-12-09T19:41:13 1765309273

Watch a small child use a computer.

sbarre · 2025-12-05T14:29:40 1764944980

I got a 4k 55" TV for $299 earlier this year. It weighs maybe 10lbs, and is super thin and fits on the wall.

Large 4k TVs being this accessible/affordable for most households has not been an option for "decades"..

Retric · 2025-12-05T14:35:29 1764945329

Screen size makes little difference for an individual they can just sit closer, viewing angels are the problem for a family where 55” doesn’t cut it.

4k also makes little difference here, most people really don’t care as seen by how many people use simple HD vs 4k streaming.

dpark · 2025-12-05T14:41:49 1764945709

> Screen size makes little difference for an individual they can just sit closer

This is silly. Most people don’t want to sit in a chair 3 feet from their TV to make it fill more of their visual area. A large number of people are also not watching movies individually. I watch TV with my family far more than I watch alone.

Retric · 2025-12-05T14:49:29 1764946169

> This is silly.

Tell that to every streaming on their tablets sitting on their stomachs. People even watch movies on their phones but they aren’t holding them 15’ away.

Also you don’t need to sit 3’ from a 37” TV.

dpark · 2025-12-05T15:10:17 1764947417

No one says the experience of watching on their tablet matches the experience of watching a movie in the theater.

But this isn’t the point. TVs are furniture. People generally have a spot where the TV naturally fits in the room regardless of its size. No one buys a TV and then arranges the rest of their furniture to sit close enough to fill their visual space. If the couch is 8 feet from the TV, it’s 8 feet from the TV.

Retric · 2025-12-05T15:12:24 1764947544

People watching their tablet on a couch in from of a 55+” TV with a surround sound speaker system says on some level it’s a better experience. I’ve seen plenty of people do this to say it’s common behavior.

> No one buys a TV and then arranges the rest of their furniture to sit close enough to fill their visual space. If the couch is 8 feet from the TV, it’s 8 feet from the TV.

It’s common on open floor plans / large rooms for a couch to end up in a completely arbitrary distance from a TV rather than next to a wall. Further setting up the TV on the width vs length vs diagonal of a room commonly provides two or more options for viewing distance.

wooger · 2025-12-11T12:05:44 1765454744

Maybe half this, based on the living room photos I see online is that people have their tv mounted multiple feet too high for comfortable watching.

dpark · 2025-12-05T17:07:03 1764954423

> People watching their tablet on a couch in from of a 55+” TV with a surround sound speaker system says on some level it’s a better experience.

It’s a more private/personal experience. Turning on the TV means everyone watches.

> It’s common on open floor plans / large rooms for a couch to end up in a completely arbitrary distance from a TV rather than next to a wall. Further setting up the TV on the width vs length vs diagonal of a room commonly provides two or more options for viewing distance.

You’re essentially arguing that people can arrange their furniture for the best viewing experience. Which is true, but also not what people actually do.

The set of people willing to arrange their furniture for the best movie watching experience in their home are the least likely to buy a small TV.

Retric · 2025-12-05T18:10:43 1764958243

> Turning in the TV means everyone watches.

People still do this while home alone, you’re attacking a straw man.

> least likely to buy a small TV.

People can only buy what actually exists. My point was large TV’s “have been out for decades they really aren’t a replacement” people owning them still went to the moves.

dpark · 2025-12-05T18:22:06 1764958926

> People still do this while home alone, you’re attacking a straw man.

Maybe? You’re making blind assertions with no data. I have no idea how frequently the average person sits in front of their 60” TV by themselves and watches a movie on their tablet. My guess is not very often but again, I have no data on this.

> My point was large TV’s “have been out for decades they really aren’t a replacement” people owning them still went to the moves.

And we come back to the beginning where your assertion is true but also misleading.

Most people have a large tv in their homes today. Most people did not have this two decades ago, despite then being available.

The stats agree. TV sizes have grown significantly.

https://www.statista.com/chart/3780/tv-screen-size/?srsltid=...

Retric · 2025-12-05T18:49:26 1764960566

> Maybe? You’re making blind assertions with no data.

I’ve seen or talked to more than five people doing it (IE called them, showed up at their house, etc) and even more people mentioned doing the same when I asked. That’s plenty of examples to say it’s fairly common behavior even if I can’t give you exact percentages.

Convince vs using the TV remove was mentioned, but if it’s not worth using the remote it’s definitely not worth going to the moves.

averageRoyalty · 2025-12-06T21:19:30 1765055970

> I have no idea how frequently the average person sits in front of their 60” TV by themselves and watches a movie on their tablet.

If you want some anecdota, I do this regularly. If I'm watching something and I may have to move somewhere in the house during, it's just practical.

nevertoolate · 2025-12-05T19:07:18 1764961638

I do. I’ve researched the optimal distance for a smallish tv screen (which fits between the studio monitor stand). I move the tv closer when watching a film, it stands on hacked together wooden box like thing which has some yoga tools and film magazines in it - it has wheels. Crazy stuff. There is a flipchart like drawing of my daughter covering the tv normally which we flip when watching films.

vharish · 2025-12-05T16:00:53 1764950453

Living rooms are not that big to start with. I don't think you actually asked anyone's opinion on this! :D

Small TVs are not comfortable to watch. No one I know is okay with getting a smaller TV and moving their sofa closer. That sounds ridiculous. If there's any comfort to this capatilistic economy, it is the availability of technology at throw away prices. Most people would rather spend on a TV than save the money.

As for the theatre being obsolete, I do agree with you, atleast to some extent. I think everyone is right here. All factors combined is what makes going to the theatre not worth the effort for most of the movies. It's just another nice thing, not what it used to be.

Also, the generational difference too. I think teen and adolescents have a lot of ways to entertain themselves. The craze for movies isn't the same as it used to be. And we grew old(er). With age, I've grown to be very picky with movies.

Retric · 2025-12-05T16:20:01 1764951601

37+” isn’t a small TVs. Resolution was an issue in the 90’s but midsized TV’s have been around for a long time.

Also, I see plenty of people use tablets to watch stuff laying on the couch in front of a big screen TV. So viewing distance is plenty relevant.

sbarre · 2025-12-05T14:26:58 1764944818

You're describing TV and movies since forever.

Ever year there are a few good shows and movies and a lot of mid-to-bad shows and movies.

This is not a Netflix thing, nor is it new.

HDThoreaun · 2025-12-05T19:31:42 1764963102

Just not true with HBO. Most of their content is consistently pretty good

triceratops · 2025-12-06T00:11:11 1764979871

HBO is expensive and most people don't have it. Ergo most people never see or hear about their lower quality content. Only the good stuff that their rich friends rave about.

https://en.wikipedia.org/wiki/List_of_HBO_original_programmi...

I don't recognize half the titles on that page.

They also make less content overall. This makes sense because they are one TV channel and assume you can get your reality TV fix somewhere else.

Netflix wants to be the only thing you watch. So they have to serve all needs.

HDThoreaun · 2025-12-06T00:16:46 1764980206

You not recognizing their shows doesnt mean they are bad. Ive seen most of those and the overwhelming majority are at least solid. I understand netflix's business model, Im just annoyed that theyre buying HBO because they will likely make it worse. Maybe netflix wants more prestige content and will let HBO be HBO but I doubt it.

sbarre · 2025-12-05T14:25:33 1764944733

Endgame: Netflix renames itself to HBO

sbarre · 2025-12-05T14:24:07 1764944647

Yeah until Netflix adds tiered pricing for content and you end up paying more than what Netflix + HBO Max together would have cost because Netflix is the only game in town for that content..

I think like all media consolidation this will send a lot of people back to the seven seas..

autoexec · 2025-12-05T14:48:42 1764946122

The seven seas can't stop netflix from canceling good shows though.

sbarre · 2025-11-30T02:09:51 1764468591

Honest question: given all the companies and people working on anti-cheat systems for the last 20+ years of multiplayer video games, don't you think it would all be server-side if it could be, by now?

smolder · 2025-11-30T05:14:34 1764479674

No, game companies are simply unwilling to pay for the talent and man hours that it takes to police their games for cheaters. Even when they are scanning your memory and filesystem they don't catch people running the latest rented cheat software.

thenthenthen · 2025-11-30T05:23:08 1764480188

Cheating is a social problem, not a technical issue. Just give the community dedicated server possibility (remember how back in the days games used to ship with dedicated server binaries?) and the community can police for free! Wow!

smolder · 2025-11-30T05:31:19 1764480679

Yes, I would also prefer that servers were community run as in the hl2 days.

I would still argue that there are technical issues leading to some amount of cheating. In extraction shooters like Hunt Showdown, Escape From Tarkov and a few others, people can run pcie devices that rip player location and other information from the machines memory in order to inject it into an overlay with a 2nd computer, and they do go to these lengths to cheat, giving them a huge advantage. It wouldn't be possible to rip that info from memory for these "ESP cheats" if the server didn't needlessly transmit position information for players that aren't actually visible. IMO this is a technical failure. There are other steps that could be taken as well, which just aren't because they're hard.

ThatPlayer · 2025-11-30T06:33:54 1764484434

Yes, because players want to spend time moderating other players instead of playing the game. Sounds fun!

Community servers literally invented anti-cheat. All current big name anti-cheats started as anti-cheats for community servers. And admins would choose to use them. Game developers would see that and integrate it. Quake 3 Arena even added Punkbuster in a patch.

Modern community servers like FiveM for GTAV, or Face-It and ESEA for CS2 have more anti-cheats, not less.

smolder · 2025-12-01T06:41:37 1764571297

This is a misunderstanding of what community is, said by someone who doesn't know.

d3Xt3r · 2025-11-30T02:58:53 1764471533

No, because most companies will make decisions based on time/effort/profitability, and because client-side anticheat is stupid simple and cheap, that's what they go with. Why waste their own server resources, when they can waste the user's?

sbarre · 2025-11-30T03:33:39 1764473619

Alright then, sounds like you've got it all figured out.

Covenant0028 · 2025-11-30T07:13:19 1764486799

So it is the company prioritising their bottom line at the expense of their customer's computers. More simply, they move cost from their balance sheet and convert it into risk on the customer's end.

Which is actively customer-abusive behavior and customers should treat it with the contempt it deserves. The fact that customers don't, is what enables such abuse.

sbarre · 2025-11-30T13:22:12 1764508932

This is such a weird take. In an online multiplayer game the cheaters are the risk to the company's bottom line.

If a game is rampant with cheaters, honest paying players stop showing up, and less new players sign up. The relatively small percentage of cheaters cost the company tons of sales and revenue.

It is actively in a company's best interest to do everything they possibly can to prevent cheating, so the idea that intentionally building sub-par anti-cheat is about "prioritising their bottom line" seems totally absurd to me.

Not to mention these abstract "the company" positions completely ignore all the passionate people who actually make video games, and how much most of them care about fair play and providing a good experience to their customers. No one hates cheaters more than game developers.

Covenant0028 · 2025-12-05T02:23:06 1764901386

I'll quote what the person I responded to said:

> because most companies will make decisions based on time/effort/profitability, and because client-side anticheat is stupid simple and cheap, that's what they go with. Why waste their own server resources, when they can waste the user's?

And my comment was a response to that statement. In context of that statement, companies are indeed choosing to prioritise their commercial interests in a way that increases the risk to the computers of their customers.

> Not to mention these abstract "the company" positions completely ignore all the passionate people who actually make video games

Irrelevant. Companies and their employees are two different distinct entities and a statement made about one does not automatically implicate the other. Claiming, for example, that Ubisoft enables a consistent culture of sexual harassment does not mean random employees of that company are automatically labeled as harassers.

Coming to anti-cheat, go ahead and fight them all you want. That's not a problem. Demanding the right to introduce a security backdoor into your customer's machines in order to do that, is the problem.

sbarre · 2025-11-29T13:33:08 1764423188

> pretty much entirely just generalizations of their own experience, but phrased as if they're objective truth

I mean you're describing 90% of blog and forum posts on the Internet here.

This (IMO - so it's not ironic) is the biggest leap most people need to make to become more self-aware and to better parse the world around them: recognizing there is rarely an objective truth in most matters, and the idea that "my truth is not your truth, both can be different yet equally valid" (again in most cases, not all cases).

saghm · 2025-11-29T19:39:32 1764445172

I think my issue is that the blog post comes across to me as in essence an argument that the person communicating shouldn't be dissuaded by potential reactions to what they say, but it fails to account for the difference between good-faith and bad-faith reactions. There's a huge difference between a bad-faith misinterpretation and a good-faith misunderstanding in my opinion, as the latter seems to come just as often from a failure on the part of the communicator to be clear as from any fault on the listener. It's hard for me not to get the impression that the author either can't or doesn't seen the value in differentiating between those cases based on the fact there's such significant room for improvement in clarifying their views in their paragraph about remote work, which is why I called it out.

sbarre · 2025-11-30T02:13:45 1764468825

Totally fair, I agree with your take on the blog post for sure.

sbarre · 2025-11-27T15:15:24 1764256524

A question I don't see addressed in all these articles: what prevents Nvidia from doing the same thing and iterating on their more general-purpose GPU towards a more focused TPU-like chip as well, if that turns out to be what the market really wants.

timmg · 2025-11-27T15:34:22 1764257662

They will, I'm sure.

The big difference is that Google is both the chip designer *and* the AI company. So they get both sets of profits.

Both Google and Nvidia contract TSMC for chips. Then Nvidia sells them at a huge profit. Then OpenAI (for example) buys them at that inflated rate and them puts them into production.

So while Nvidia is "selling shovels", Google is making their own shovels and has their own mines.

pzo · 2025-11-27T19:37:29 1764272249

on top of that Google is also cloud infrastructure provider - contrary to OpenAI that need to have someone like Azure plug those GPUs and host servers.

throwawayffffas · 2025-11-28T00:55:01 1764291301

I am pretty sure OpenAI has data centers of its own.

veunes · 2025-11-28T11:03:01 1764327781

The own shovels for own mines strategy has a hidden downside: isolation. NVIDIA sells shovels to everyone - OpenAI, Meta, xAI, Microsoft - and gets feedback from the entire market. They see where the industry is heading faster than Google, which is stewing in its own juices. While Google optimizes TPUs for current Google tasks (Gemini, Search), NVIDIA optimizes GPUs for all possible future tasks. In an era of rapid change, the market's hive mind usually beats closed vertical integration.

1980phipsi · 2025-11-27T15:35:47 1764257747

Aka vertical integration.

ForHackernews · 2025-11-28T00:02:10 1764288130

Selling shovels may still turn out to be the right move: Nvidia got rich off the cryptocurrency bubble, now they're getting even richer off the AI bubble.

Having your own mines only pays off if you actually do strike gold. So far AI undercuts Google's profitable search ads, and loses money for OpenAI.

sagarm · 2025-11-27T22:28:32 1764282512

> AI ... profits

Citation needed. But the vertical integration is likely valuable right now, especially with NVidia being supply constrained.

m4rtink · 2025-11-27T17:17:22 1764263842

So when the bubble pops the companies making the shovels (TSMC, NVIDIA) might still have the money they got for their products and some of the ex-AI companies might least be able to sell standard compliant GPUs on the wider market.

And Google will end up with lots of useless super specialized custom hardware.

skybrian · 2025-11-27T19:19:26 1764271166

It seems unlikely that large matrix multipliers will become useless. If nothing else, Google uses AI extensively internally. It already did in ways that weren’t user-visible long before the current AI boom. Also, they can still put AI overviews on search pages regardless of what the stock market does. They’re not as bad as they used to be, and I expect they’ll improve.

Even if TPU’s weren’t all that useful, they still own the data centers and can upgrade equipment, or not. They paid for the hardware out of their large pile of cash, so it’s not debt overhang.

Another issue is loss of revenue. Google cloud revenue is currently 15% of their total, so still not that much. The stock market is counting on it continuing to increase, though.

If the stock market crashes, Google’s stock price will go down too, and that could be a very good time to buy, much like it was in 2008. There’s been a spectacular increase since then, the best investment I ever made. (Repeating that is unlikely, though.)

nutjob2 · 2025-11-27T20:45:58 1764276358

How could Google's custom hardware become useless? They've used it for their business for years now and will do so for years into the future. It's not like their hardware is LLM specific. Google cannot lose with their vast infrastructure.

Meanwhile OpenAI et al dumping GPUs while everyone else is doing the same will get pennies on the dollar. It's exactly the opposite to what you describe.

I hope that comes to pass, because I'll be ready to scoop up cheap GPUs and servers.

qcnguy · 2025-11-27T21:27:33 1764278853

Same way cloud hardware always risks becoming useless. The newer hardware is so much better you can't afford to not upgrade, e.g. an algorithmic improvement that can be run on CUDA devices but not on existing TPUs, which changes the economics of AI.

timmg · 2025-11-27T18:39:37 1764268777

> And Google will end up with lots of useless super specialized custom hardware.

If it gets to the point where this hardware is useless (I doubt it), yes Google will have it sitting there. But it will have cost Google less to build that hardware than any of the companies who built on Nvidia.

UncleOxidant · 2025-11-27T19:23:22 1764271402

Right, and the inevitable bubble pop will just slow things down for a few years - it's not like those TPUs will suddenly be useless, Google will still have them deployed, it's just that instead of upgrading to a newer TPU they'll stay with the older ones longer. It seems like Google will experience much less repercussions when the bubble pops compared to Nvidia, OpenAI, Anthropic, Oracle etc. as they're largely staying out of the money circles between those companies.

heisenbit · 2025-11-27T21:43:58 1764279838

And running loads long term profitable may require both lower power use as well as longer chip lifetimes - something associated with lower power use.

immibis · 2025-11-27T18:53:40 1764269620

aka Google will have less of a pile of money than Nvidia will

kolbe · 2025-11-27T19:29:00 1764271740

Alphabet is the most profitable company in the world. For all the criticisms you can throw at Google, lacking a pile of money isn't one of them.

acoustics · 2025-11-27T17:54:19 1764266059

I think people are confusing the bubble popping with AI being over. When the dot-com bubble popped, it's not like internet infrastructure immediately became useless and worthless.

iamtheworstdev · 2025-11-27T18:54:19 1764269659

that's actually not all that true... a lot of fiber that had been laid went dark, or was never lit, and was hoarded by telecoms in an intentional supply constrained market in order to drive up the usage cost of what was lit.

pksebben · 2025-11-27T20:19:12 1764274752

If it was hoarded by anyone, then by definition not useless OR worthless. Also, you are currently on the internet if you're reading this, so the point kinda stands.

ithkuil · 2025-11-27T19:44:31 1764272671

Are you saying that the internet business didn't grow a lot after the bubble popped?

bryanlarsen · 2025-11-27T20:08:08 1764274088

And then they sold it to Google who lit it up.

blinding-streak · 2025-11-28T04:56:03 1764305763

Google uses TPUs for its internal AI work (training Gemini for example), which surely isn't decreasing in demand or usage as their portfolio and product footprint increases. So I have a feeling they'd be able to put those TPUs to good use?

Workaccount2 · 2025-11-27T15:56:22 1764258982

Deepmind gets to work directly with the TPU team to make custom modifications and designs specifically for deepmind projects. They get to make pickaxes that are made exactly for the mine they are working.

Everyone using Nvidia hardware has a lot of overlap in requirements, but they also all have enough architectural differences that they won't be able to match Google.

OpenAI announced they will be designing their own chips, exactly for this reason, but that also becomes another extremely capital intensive investment for them.

This also doesn't get into that Google also already has S-tier dataceters and datacenter construction/management capabilities.

wood_spirit · 2025-11-27T18:55:18 1764269718

Isn’t there a suspicion that OpenAI buying custom chips from another Sam Altman venture is just graft? Wasn’t that one of the things that came up when the board tried to out him?

saagarjha · 2025-11-28T01:06:54 1764292014

The chips are being done in-house.

overfeed · 2025-11-28T04:11:29 1764303089

It was only brought in-house after the $5,000,000,000,000 self-dealing AI chip venture failed to launch.

saagarjha · 2025-11-28T15:36:17 1764344177

Nvidia?

overfeed · 2025-11-29T03:51:57 1764388317

Sam Altman attempted to raise $5Tn for an AI-chip startup

saagarjha · 2025-11-29T12:38:27 1764419907

Link? I only know of Rain and they raised <<$1 billion IIRC

overfeed · 2025-11-29T19:39:02 1764445142

Note that I said "attempted". https://www.wsj.com/tech/ai/sam-altman-seeks-trillions-of-do...

saagarjha · 2025-11-30T14:38:57 1764513537

I think this was to build datacenters

01100011 · 2025-11-28T08:07:14 1764317234

> Deepmind gets to work directly with the TPU team to make custom modifications

You don't think Nvidia has field-service engineers and applications engineers with their big customers? Come on man. There is quite a bit of dialogue between the big players and the chipmaker.

Workaccount2 · 2025-11-28T15:15:47 1764342947

They do, but they need to appease a dozen different teams from a dozen different labs, forcing nvidia to take general approaches and/or dictating approaches and pigeonholing labs into using those methods.

Deepmind can do whatever they want, and get the exact hardware to match it. It's a massive advantage when you can discover a bespoke way of running a filter, and you can get a hardware implementation of it without having to share that with any third parties. If OpenAI takes a new find to Nvidia, everyone else using Nvidia chips gets it too.

01100011 · 2025-11-28T19:27:12 1764358032

This ignores the way it often works: Customer comes to NVDA with a problem and NVDA comes up with a solution. This solution now adds value for every customer.

In your example, if OpenAI makes a massive new find they aren't taking it to NVDA.

Nvidia has the advantage of a broad base of customers that gives it a lot of information on what needs work and it tries to quickly respond to those deficiencies.

Workaccount2 · 2025-11-29T01:16:45 1764379005

>In your example, if OpenAI makes a massive new find they aren't taking it to NVDA.

Right, and therefore they are stuck doing it in software, while google can do it in hardware.

jauntywundrkind · 2025-11-27T19:27:19 1764271639

Nvidia doesn't have the software stack to do a TPU.

They could make a systolic array TPU and software, perhaps. But it would mean abandoning 18 years of CUDA.

The top post right now is talking about TPU's colossal advantage in scaling & throughput. Ironwood is massively bigger & faster than what Nvidia is shooting for, already. And that's a huge advantage. But imo that is a replicateable win. Throw gobs more at networking and scaling and nvidia could do similar with their architecture.

The architectural win of what TPU is more interesting. Google sort of has a working super powerful Connection Machine CM-1. The systolic array is a lot of (semi-)independent machines that communicate with nearby chips. There's incredible work going on to figure out how to map problems onto these arrays.

Where-as on a GPU, main memory is used to transfer intermediary results. It doesn't really matter who picks up work, there's lots of worklets with equal access time to that bit of main memory. The actual situation is a little more nuanced (even in consumer gpu's there's really multiple different main memories, which creates some locality), but there's much less need for data locality in the GPU, and much much much much tighter needs, the whole premise of the TPU is to exploit data locality. Because sending data to a neighbor is cheap, sending storing and retrieving data from memory is slower and much more energy intense.

CUDA takes advantage of, relies strongly on the GPU's reliance in main memory being (somewhat) globally accessible. There's plenty of workloads folks do in CUDA that would never work on TPU, on these much more specialized data-passing systolic arrays. That's why TPUs are so amazing, because they are much more constrained devices, that require so much more careful workload planning, to get the work to flow across the 2D array of the chip.

Google's work on projects like XLA and IREE is a wonderful & glorious general pursuit of how to map these big crazy machine learning pipelines down onto specific hardware. Nvidia could make their own or join forces here. And perhaps they will. But the CUDA moat would have to be left behind.

zzzoom · 2025-11-28T00:22:22 1764289342

> They could make a systolic array TPU and software, perhaps. But it would mean abandoning 18 years of CUDA.

Tensor cores are specialized and have CUDA support.

jauntywundrkind · 2025-11-28T02:04:47 1764295487

Tensor cores can help a lot for matrix maths, sure, definitely. They made a big splash in 2017 & have been essential. https://developer.nvidia.com/blog/programming-tensor-cores-c...

But it's still something grafted onto the existing architecture, of many grids with many blocks with many warps, and lots and lots of coordination and passing intermediary results around. It's only a 4x4x4 unit, afaik. There's still a lot of main memory being used to combine data, a lot of orchestration among the different warps and blocks and grids, to get big matrices crunched.

The systolic array is designed to allow much more fire and forget operations. It's inputs are 128 x 128 and each cell is its own compute node basically, shuffling data through and across (but not transitting a far off memory).

TPU architecture has plenty of limitations. It's not great at everything. But if you can design work to flow from cell to neighboring cell, you can crunch very sizable chunks of data with amazing data locality. The efficiency there is unparalleled.

Nvidia would need a radical change of their architecture to get anything like the massive data locality wins a systolic array can do. It would come with massively more constraints too.

Would love if anyone else has recommended reading. I have this piece earmarked. https://henryhmko.github.io/posts/tpu/tpu.html https://news.ycombinator.com/item?id=44342977

HarHarVeryFunny · 2025-11-27T15:43:58 1764258238

It's not that the TPU is better than an NVidia GPU, it's just that it's cheaper since it doesn't have a fat NVidia markup applied, and is also better vertically integrated since it was designed/specified by Google for Google.

UncleOxidant · 2025-11-27T19:29:25 1764271765

TPUs are also cheaper because GPUs need to be more general purpose whereas TPUs are designed with a focus on LLM workloads meaning there's not wasted silicon. Nothing's there that doesn't need to be there. The potential downside would be if a significantly different architecture arises that would be difficult for TPUs to handle and easier for GPUs (given their more general purpose). But even then Google could probably pivot fairly quickly to a different TPU design.

mr_toad · 2025-11-28T14:11:40 1764339100

The T in TPU stands for tensor, which in this context is just a fancy matrix. These days both are optimised for matrix algebra, i.e. general ML workloads, not just LLMs.

If LLMs become unfashionable they’ll still be good for other ML tasks like image recognition.

numbers_guy · 2025-11-27T15:25:13 1764257113

Nothing in principle. But Huang probably doesn't believe in hyper specializing their chips at this stage because it's unlikely that the compute demands of 2035 are something we can predict today. For a counterpoint, Jim Keller took Tenstorrent in the opposite direction. Their chips are also very efficient, but even more general purpose than NVIDIA chips.

mindv0rtex · 2025-11-27T18:06:39 1764266799

How is Tenstorrent h/w more general purpose than NVIDIA chips? TT hardware is only good for matmuls and some elementwise operations, and plain sucks for anything else. Their software is abysmal.

ezekiel68 · 2025-11-28T06:04:43 1764309883

Of course there's the general purpose RISC V CPU controller component but also, each NPU is designed in troikas that have one core reading data in, one core performing the actual kernel work, and the third core forwarding data out.

llm_nerd · 2025-11-27T15:27:43 1764257263

For users buying H200s for AI workloads, the "ASIC" tensor cores deliver the overwhelming bulk of performance. So they already do this, and have been since Volta in 2017.

To put it into perspective, the tensor cores deliver about 2,000 TFLOPs of FP8, and half that for FP16, and this is all tensor FMA/MAC (comprising the bulk of compute for AI workloads). The CUDA cores -- the rest of the GPU -- deliver more in the 70 TFLOP range.

So if data centres are buying nvidia hardware for AI, they already are buying focused TPU chips that almost incidentally have some other hardware that can do some other stuff.

I mean, GPUs still have a lot of non-tensor general uses in the sciences, finance, etc, and TPUs don't touch that, but yes a lot of nvidia GPUs are being sold as a focused TPU-like chip.

sorenjan · 2025-11-27T15:38:48 1764257928

Is it the Cuda cores that run the vertex/fragment/etc shaders in normal GPUs? Where does the ray tracing units fit in? How much of a modern Nvidia GPU is general purpose vs specialized to graphics pipelines?

qcnguy · 2025-11-27T21:30:41 1764279041

A datacenter GPU has next to nothing left related to graphics. You can't use them to render graphics. It's a pure computational kernel machine.

fooker · 2025-11-27T15:26:12 1764257172

That's exactly what Nvidia is doing with tensor cores.

bjourne · 2025-11-27T15:43:51 1764258231

Except the native width of Tensor Cores are about 8-32 (depending on scalar type), whereas the width of TPUs is up to 256. The difference in scale is massive.

neilmovva · 2025-12-01T01:15:24 1764551724

I think Hopper's native matmul tile is 64x64, and Blackwell is 128x128.

see this blog for a reference on Blackwell:

https://hazyresearch.stanford.edu/blog/2025-03-15-tk-blackwe...

fooker · 2025-11-28T03:35:23 1764300923

If it turns out to be useful, Nvidia can't just tweak a parameter in their verilog and declare victory?

If not, what's fundamentally difficult about doing 32 vs 256 here?

saagarjha · 2025-11-28T01:07:50 1764292070

Nobody cares about width; they care about TFLOPs.

LogicFailsMe · 2025-11-27T15:33:13 1764257593

That's pretty much what they've been doing incrementally with the data center line of GPUs versus GeForce since 2017. Currently, the data center GPUs now have up to 6 times the performance at matrix math of the GeForce chips and much more memory. Nvidia has managed to stay one tape out away from addressing any competitors so far.

The real challenge is getting the TPU to do more general purpose computation. But that doesn't make for as good a story. And the point about Google arbitrarily raising the prices as soon as they think they have the upper hand is good old fashioned capitalism in action.

blibble · 2025-11-27T15:18:35 1764256715

the entire organisation has been built over the last 25 years to produce GPUs

turning a giant lumbering ship around is not easy

sbarre · 2025-11-27T15:21:18 1764256878

For sure, I did not mean to imply they could do it quickly or easily, but I have to assume that internally at Nvidia there's already work happening to figure out "can we make chips that are better for AI and cheaper/easier to make than GPUs?"

coredog64 · 2025-11-27T20:01:19 1764273679

Isn't that a bit like Kodak knowing that digital cameras were a thing but not wanting to jeopardize their film business?

sofixa · 2025-11-27T15:19:15 1764256755

> what prevents Nvidia from doing the same thing and iterating on their more general-purpose GPU towards a more focused TPU-like chip as well, if that turns out to be what the market really wants.

Nothing prevents them per se, but it would risk cannibalising their highly profitable (IIRC 50% margin) higher end cards.

baron816 · 2025-11-27T22:59:27 1764284367

It’s not binary. It’s not existential. What’s at stake for Nvidia is its HUGE profit margins. 5 years from now, Nvidia could be selling 100x as many chips. But its market cap could be a fraction of what it is now if competition is so intense that its making 5% profit margin instead of 90%.

storus · 2025-11-28T02:35:17 1764297317

More like 900% right now.

torginus · 2025-11-28T00:50:16 1764291016

My personal guess would be what drives the cost and size of these chips is the memory bandwidth and the transcievers required to support it. Since transcievers/memory controllers are on the edge of the chip, you get a certain minimum circumference for a given bandwidth, which determines your min surface area.

It might be even 'free' to fill it with more complicated logic (especially one that allows you write clever algorithms that let you save on bandwidth).

sojuz151 · 2025-11-27T15:35:45 1764257745

They lose the competitive advantage. They have nothing more to offer than what Google has in-house.

sbarre · 2025-11-27T00:38:55 1764203935

I definitely do not return to a hotel where the bathroom was sub-par...

And likewise I absolutely return to a hotel where the bathroom was good when going back to a city.

I'm mostly talking about the water pressure for the shower here, but you get the idea.