Hacker Newsnew | past | comments | ask | show | jobs | submit | system2's commentslogin

Until I can buy an 80GB VRAM GPU, I won't attempt to do it. A local LLM is always missing something that needs a bigger model.

Which model class requires an 80 GB VRAM GPU? From my perspective, popular models seem to be either in the ~30B range (Qwen3.6, Gemma 4), while the larger models (MiniMax, MiMo, StepFun, Deepseek) are in the multiple hundreds of billions parameters, for which 80 GB is simply too small.

You can just about reach the lower end of the latter category with a 128GB machine like a DGX Spark, Framework Desktop, or M5 Max, though those are usually not super fast. For the former category, you can easily run them fast with something like a 3090 or 5090, hell, probably even a 5060 Ti.


Video models.

This is true. There's not much point in buying only one RTX 6000. You need at least two to run anything interesting that you couldn't run on a 5090. And you can imagine where it goes from there.

Is bypassing the router a good idea?

Yes if you want to. Routers are a necessary abstraction from the IPv4 days and seems it will stick around for a long time, and we need solutions sometimes around those topologies.

Are you conflating a router with SNAT? Routers as in L3 routing are not an "IPv4 only abstraction."

Yes I used it in place of NAT for most casual users at home, which is presumably what the user above originally meant.

I think even Anthropic is very happy about it. It makes them look very advanced. But we all can see this is fake drama.

I am certain this is hype. Tomorrow, they can release Opus 4.9 and claim it is 99.99% close to Fable.

We definitely reached the available capability plateau. You are 100% correct IMHO.

Wait a few weeks. They won't be able to generate enough without it; it will get reversed and things will just continue as normal.

LLM is plaguing the internet as usual. This site is proof of what a non-technical, inexperienced person comes up with with vibe coding.

With Claude Code in less than 5 minutes, I can come up with something 10x better, at least usable, with basic UX knowledge and flow basics.

Sorry if I offended the author, but he/she can learn from these comments.


People burning tokens for the most beginner HTML/CSS problems and writing about it is concerning.

We are at the point where AI starts to seriously impact abilities. Sure, a 2 line CSS fix is the solution, but the human “behind the wheel” has already prompted 6 times and gotten 80% there. It’s been “easy” thus far. No shot they are going to FINALLY look at and edit the code. It’s just one more prompt and the agent will probably fix it, right?

It’s wild. I’ve been in the situation. 80% into a project I COULD probably take over, but realistically? 2 more lines of me prompting could fix it, it’s too easy to avoid the hard work of understanding the code, logic, architecture, etc…


Well the solution is incorrect. The problem seems to be that the css code does not normalize to box-sizing: border-box; among other things. The bad prompt by the author probably sent fable into the wrong rabbit hole

I dunno about beginner, I've been doing HTML+CSS for a few decades and I still find bugs where Safari differs from Chrome+Firefox pretty hard to figure out.

Wouldn't it be easier and better to just copy the HTML div and tell what was happening instead of a screenshot? Typically, these scrollbars appear because of a nested div with dynamic unrestircted width and/or overflow.

No wonder why people burn through tokens.


Pentium 100 couldn't even play Quake2 properly. You probably mean Pentium 2 series.

Pentium 1 133mhz ran Quake2 pretty darn well as long as you had hardware accel. Without hadware accel it was ass.

(maybe even Pentium 100)


Their mission is to make money and become a government watchdog.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: