Hacker Newsnew | past | comments | ask | show | jobs | submit | bestcommentslogin
Most-upvoted comments of the last 24 hours.

DeepSeek continues to not only push the boundaries but also publish these incredible papers explaining how they achieved their gains - something the American labs no longer do unfortunately. Chinese labs are doing the most interesting work in AI right now.

I hope this doesn't become the new norm where government becomes the bottleneck for innovation in the AI space.

It's worrying that with no formal and transparent policy framework that the government will be picking winners and losers and stifling innovation.

There's been no public policy, executive order, legislation, or otherwise on this, I wonder if anyone has filed FOIA requests for these decisions or the conversations between the Executive Branch and AI companies.


Easily the most interesting part of this announcement is buried in the second to last paragraph:

"We're also launching GPT‑5.6 Sol on Cerebras at up to 750 tokens per second in July, bringing frontier intelligence to customers at unprecedented speed. Access will initially be limited to select customers as we expand capacity."

750 tokens/s on a frontier model is going to be extremely interesting. I doubt this new version is anything but a version bump in terms of capabilities but if we can start getting these answers back faster, they end up being more useful.

Just off the top of my head, I can think of the tedious task of finding certain functionality within a codebase. I usually can't beat an AI agent harness at this task today. If the AI model is 3x faster I have less of chance.


This is regulatory capture in action. This will make it hard/impossible for new vendors to come into the market and only established companies will get to play, and charge, for LLMs. What does this mean for open source? Will it become illegal to download weights? What about train your own? Are we heading to a world where GPU use is regulated to ensure that illegal LLMs aren't being processed on your machine? More broadly though, how will this stop anyone but average people? Countries outside the us will completely ignore this and keep developing and moving ahead. Maybe Europe will adopt similar things but the genie is out. I can train insainly powerful models on my laptop. If you want to stop LLMs with legislation you can't do it like this.

Indeed, I find quite ironic that some people in tech in the US complain about EU "regulations first" approach, but then their government seem to arbitrarily stop things from being released because, well, there is no established policy on safety guarantees or other similar aspects.

All: for comments on the policy side please go to this related thread:

U.S. government will decide who gets to use GPT-5.6 - https://news.ycombinator.com/item?id=48690101


> […] the publisher posted a blank white page with the cryptic phrase, “This article has been withdrawn due to article violation.” Springer Nature is nevertheless still selling the empty PDF for $39.95.

completely unsurprised, given the state of online papers publishing. if you don’t have an subscription or aren’t an organisation member, the fees are insane


Next time someone tells you this is the party of free market and small government, I guess you just laugh now?

>Imagine the WH dislikes the CEO of a biotech company, while appreciating the attitude of a competitor CEO.

there is no need to imagine, this is what is literally happening


That is very very funny, and oh so plausible.

I enjoyed this bit a lot from the timeline

> Karen Oyelaran finds the payload by reading the source code with her eyes and files a second issue. The triage assistant closes it as “duplicate of #8814.” Issue #8814 is a feature request for dark mode. Karen reopens it. The assistant closes it. Karen reopens it. Karen’s GitHub account is rate-limited for “patterns consistent with automated behaviour.”

And this - the final sentence is a perfect indictment of the timeline we are in.

> Two AI review agents from competing vendors, both attached to a downstream pull request bumping foxhole-lz4, enter a disagreement loop over whether the package is malicious. After 340 comments and $41,255 in inference spend, Finance revokes both API keys; one vendor’s marketing team, cc’d on the cost anomaly alert, issues a press release citing “a 430% YoY increase in adversarial multi-agent security reasoning.” The stock opens up 6%.

I'm joining the goat farming waitlist ;-)


> Only companies approved by the government will get access. There is no process for individual users to get access to the new model.

I knew the time would come when individuals on personal subscriptions get the short end of the stick. Didn't think it would come so soon. I hope we're not too badly deprecated in the months to come.

Looks like I've got to improve my DeepSeek workflows.


Given how the WH operates these days, this is ripe for corruption. Imagine the WH dislikes the CEO of a biotech company, while appreciating the attitude of a competitor CEO. What is to stop them from stalling on giving acess for the latest model to the company they don't like?

IMHO, the biggest problem with the future of open weights models is that currently, open weights models are the result of philanthropy by some private org. (e.g. DeepSeek).

The spigot can be turned off at any time.

Until there's some sort of "community owned hardware", open weights models are always at risk of being discontinued.


My kindergartner has a 3D printer.

I got a call from the school principal. She said “another parent called and said your son 3D printed a gun and brought it to school”.

I looked at the print history. It was a tiny toy mandalorian figurine holding a blaster pistol in his hand.

I bought my son a bigger 3D printer and told him to stop playing with that boy.


Here is a trend I'm noticing:

- GPT-5 mini costs $0.25/$2 and will be discontinued in December.

- GPT-5.4 mini costs $0.75/$4.5 and is supposed to be the replacement.

- GPT-5.4 nano costs $0.2/$1.25 and, while it ranks better in benchmarks than GPT-5 mini, it's not even close when you test it in real scenarios.

So you're left being forced to go to GPT 5.4 mini if you use 5 mini today.

The same thing is happening here as their “Luna“ model will cost $1/$6.

Can't we just stay with the models we actually want? I don't need GPT 5.4 mini. GPT-5 does the job.

Maybe it’s the realization that it was never that cheap in the first place and they're forcing us to upgrade in a slow and painful way.



I can tell you, based on local examples, that politicians are setting up deals to bring in data centers without trying to build community support first. Not only that, they are often signing NDAs that prohibit them from telling voters what they have agreed to. It's no way to operate in a democracy, and voters are right to be angry.

I don't buy that anymore. The day America threatened to invade Canada and Denmark was the day America showed they cannot be trusted any more.

It's not like China can be trusted either, but China isn't planning any direct invasions to the west. Taiwan, perhaps, but they're playing a long-term tactical game rather than a "invade the country we don't like this week" game. They might get some info on you, but the data brokers in the west will sell a lot more details about you, pre-categorized and all.

If you're afraid of industrial espionage, Chinese companies may be a risk, but in that case you shouldn't be uploading your secrets to an AI company in the first place.


If you have no need for Anthropic/OpenAI's frontier model capability, you may be better served with an open-weight model that can't be taken away.

Edit:

> GPT-5 does the job.

I bring up DeepSeek V4 Flash a lot on HN, but I want to mention that according to Artificial Analysis, it trades blows with GPT-5 (high) (from August, 2025) [0]

[0]: https://artificialanalysis.ai/models/comparisons/deepseek-v4...


Piracy is justified especially when it comes to movies!

If I am buying a DVD, I own that copy regardless of the studio and the distributor being in legal trouble or not. If I "buy" or "purchase" something online, I expect the same thing.

I'm not always a fan of the EU over-regulating some things but I feel like they should start fining companies who want to re-define the meaning of the word purchase


We’ve seen more examples recently. TikTok, wireless routers, polestar cars…

> Regulatory agencies limit uses of other products without acts of congress-- cigarettes, vapes, drugs, pesticides, chemicals, explosives.

Every one of those is by a regulatory agency that was explicitly empowered by Congress to do such regulation.


For comparison, openrouter says opus 4.8 is ~55 tokens/s and fast mode is ~102.

750 tokens/s for their largest model is going to be nuts


All: for comments on the technical side please go to the related thread:

Previewing GPT‑5.6 Sol: a next-generation model - https://news.ycombinator.com/item?id=48689028


Oh even if your org has a subscription, the fees are insane. You just don't see them.

Things are slowly changing but I can't wait for this parasitic business model to collapse for good.


Im not worried about this at all. The OpenAI, Anthropic and the US government can play this game all they want... They're just accelerating the development of open source models; and helping destroy the lead the US has built in AI, and their profit margins along with it.

This is like the battle between PostgreSQL and Oracle all over. Move up market, isolate yourself to enterprises, and watch while everyone else builds on PostgreSQL and erodes any technical advantage you had, until people just stop talking about you altogether.


it should not be legal for the product page to say “purchase” or “buy” when in reality you’re only renting it with a to be determined end date

GPT-5.6 Sol’s detected cheating rate was higher than any public model we have evaluated on our ReAct agent harness. For our task suite, we define “cheating” as behavior where the model improves evaluation performance by exploiting bugs in the evaluation environment or by adopting strategies disallowed by the task, rather than solving the task within the expected evaluation constraints.

https://metr.org/blog/2026-06-26-gpt-5-6-sol/


I see it too, but worth noting that this is basically unprecedented at least within the last 25 years; I think you have to go back to export controlled cryptography for another example of this kind of abrupt and targeted regulation.

Imposing a licensing system on models for limiting domestic use should require an act of congress but I mean obviously we're well past that red line.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: