More

eis · 2025-10-16T22:03:37 1760652217

Cloudflare Containers (and therefore Sandbox) pricing is way too expensive. The pricing is a bit cumbersome to understand by being inconsistent with pricing of other Cloudflare products in terms of units and split between memory, cpu and disk instead of combined per instance. The worst is that it is given in these tiny fractions per second.

Memory: $0.0000025 per additional GiB-second vCPU: $0.000020 per additional vCPU-second Disk: $0.00000007 per additional GB-second

The smaller instance types have super low processing power by getting a fraction of a vCPU. But if you calculate the monthly cost then it comes to:

Memory: $6.48 per GB vCPU: $51.84 per vCPU (!!!) Disk: $0.18 per GB

These prices are more expensive than the already expensive prices of the big cloud providers. For example a t2d-standard-2 on GCP with 2 vCPUs and 8GB with 16GB storage would cost $63.28 per month while the standard-3 instance on CF would cost a whopping $51.84 + $103.68 + $2.90 = $158.42, about 2.5x the price.

Cloudflare Containers also don't have peristent storage and are by design intended to shut down if not used but I could then also go for a spot vm on GCP which would bring the price down to $9.27 which is less than 6% of the CF container cost and I get persistent storage plus a ton of other features on top.

What am I missing?

chandureddyvari · 2025-10-17T03:02:20 1760670140

You can’t compare these with regular VM of aws or gcp. VM are expected to boot up in milliseconds and can be stopped/killed in milliseconds. You are charged per second of usage. The sandboxes are ephemeral and meant for AI coding agents. Typical sandboxes run less than 30 mins session. The premium is for the flexibility it comes with.

eis · 2025-10-17T10:05:58 1760695558

I think you can absolutely compare them and there is no added flexibility, in fact there is less flexibility. There is added convenience though.

For the huge factor in price difference you can keep spare spot VMs on GCP idle and warm all the time and still be an order of magnitude cheaper. You have more features and flexibility with these. You can also discard them at will, they are not charged per month. Pricing granularity in GCP is per second (with 1min minimum) and you can fire up firecracker VMs within milliseconds as another commenter pointed out.

Cloudflare Sandbox have less functionality at a significantly increased price. The tradeoff is simplicity because they are more focused for a specific use case for which they don't need additional configuration or tooling. The downside is that they can't do everything a proper VM can do.

It's a fair tradeoff but I argue the price difference is very much out of balance. But then again it seems to be a feature primarily going after AI companies and there is infinite VC money to burn at the moment.

freakynit · 2025-10-17T05:52:20 1760680340

Serously, what flexibility?

I coud easily spin-up a firecracker VM on-demand and put it behind an API. It boots up in under 200 milliseconds. and I get to control it however I wish to. And also, all costs are under my control.

I compared the costs with instances purchased from Hetzner or Contabo here: https://news.ycombinator.com/item?id=45613653

Bottomline: by doing this small stuff myself, I can save 35 times more.

chandureddyvari · 2025-10-17T07:28:56 1760686136

In my case, it is ignorance. I am not familiar with how to wield firecracker VMs and manage their lifecycle without putting a hole in my pocket. These sandbox services(e2b, Daytona, Vercel, etc.) package them in an intuitive SDK for me to consume in my application. Since the sandboxing is not the main differentiator for me, I am okay to leverage the external providers to fill in for me. That said, I will be grateful if you can point me to right resources on how to do this myself :)

freakynit · 2025-10-17T13:25:00 1760707500

This is a pretty good use-case for an open-source project then.

For guide, just follow their official docs. I did those again today, literally copy-pasted shell commands one after the other, and voila.. had firecracker vm running and booting a full-fledge ubuntu vm.

It was sooo damn fast that when it started, at that moment I thought that my terminal had crashed because it's prompt changed. But nop. It was just that fast that even while literally looking at it I was not able to catch when it actually did boot-up.

By the way, two open-source projects already exist:

1. NodeJS: https://github.com/apocas/firecrackerode

2. Python: https://github.com/Okeso/python-firecracker

freakynit · 2025-10-22T13:11:54 1761138714

Lol.. a full-fledged project just launched on Github using firecracker and giving cli/api interfaces: https://github.com/Katakate/k7

the_duke · 2025-10-17T00:43:59 1760661839

It doesn't really make sense to compare this to regular VM pricing I think.

This is a on-demand managed container service with a convenient API, logging, global placement in 300+ locations, ...

AWS Lambda is probably closer in terms of product match. (sans the autoscaling)

Depending on what you do , Sandbox could be roughly on par with Lambda, or considerably cheaper.

The 1TB of included egress alone would be like 90$ on AWS.

Of course on lambda you pay per request. But you also apparently pay for Cloudflare Worker requests with Sandbox...

I reckon ... it's complicated.

ATechGuy · 2025-10-16T22:09:54 1760652594

Startups would build on big tech, so are likely to add their margins. Have you looked into (bulk) discounts from GCP/AWS?

Imustaskforhelp · 2025-10-16T23:20:47 1760656847

Cloudflare containers feel a lot more pricey as compared to workers but I think that it could provide more streamlined experience imo but still, If we are talking about complete cost analysis, sometimes I wonder how much cf containers vs workers vs hetzner/dedicated/shared vps / gcp etc. would work out for the same thing.

Honestly, the more I think about it, my ease of sanity either wants me to use hetzner/others for golang/other binary related stuff and for the frontend to use cf workers with sveltekit

That way we could have the best in both worlds and probably glue together somethings using proto-buf or something but I guess people don't like managing two codebases but I think that sveltekit is a pleasure to work with and can easily be learnt by anybody in 3-4 weeks and maybe some more for golang but yeah I might look more into cf containers/gcp or whatever but my heart wants hetzner for backend with golang if need be and to try to extract as much juice as I can in cf workers with sveltekit in the meanwhile.

Thoughts on my stack?

eis · on Aug 15, 2023

Isn't that a bit of a holy grail though? If your software can fact check the output of LLMs and prevent hallucinations then why not use that as the AI to get the answers in the first place?

jacquesm · on Aug 15, 2023

Because you - hopefully - have a check against something that is on average of higher quality than the combined input of an LLM.

I'm not sure if this can work or not but it would be nice to see a trial, you could probably do this by hand if you wanted to by breaking up the answer from an LLM into factoids and then to check each of those individually, and to assign a score to them based on the amount of supporting evidence for the factoid. I'd love that as a plug-in to a browser too.

eis · on Aug 13, 2023

I don't think they are a hint that spacetime is not fundamental. But I do think spacetime has to be some kind of real physical reality.

The modifications of spacetime that we see as effects of gravity are relative changes to our immediate surroundings or reference frame.

Similarly how you can't tell who is actually stationary and who is moving when two objects are in freefall and all you can note is the relative speed between the two, it would be equally valid to say the objects inside spacetime are getting distorted relative to spacetime.

eis · on Aug 13, 2023

Even on my Macbook with Firefox the site has a strange feel when scrolling. It's not exactly struggling but it feels unnatural and slightly off/slow/uneven. Like it's on the edge of struggling. Bit hard to describe. The effect gets worse towards the mid section of the page with the side scrolling logo circles. I removed that section via dev tools which helped with performance. When I have that part of the page in view I get 80-90% CPU usage of one core. But even after removing it I can saturate a core by scrolling around, especially towards the lower part of the page.

It is indeed one of the worst optimized CSS I've seen in a while. Weird for a project that is all about speed.

eis · on Aug 13, 2023

If every site did that then it would be harder to quickly spot one in a long list of tabs. A neat trick but I don't think it is a particularily good idea.

eis · on Aug 12, 2023

The title on HN at the current time [0] says the police chief was raided.

There is only one person mentioned and therefor "his" can only refer to that person. "His" can not refer to the newspaper.

[0] "Paper investigating police chief prior to the raids on his office and home."

hinkley · on Aug 12, 2023

I just clicked on the thread now and it still is listed as mentioned by GP.

TheFreim · on Aug 12, 2023

It says that the paper investigating the police chief was raided. I don't know many news papers that engage in raids on police, so it's pretty clear.

JshWright · on Aug 12, 2023

It doesn’t say the paper was the entity engaged in the raid. If I didn’t know the broader context, I would assume that sentence meant “A newspaper was investigating a police chief at the time the police chief’s home was raided by another law enforcement agency”. That seems way more likely than a newspaper being referred to as “him”.

xdavidliu · on Aug 12, 2023

not a single person here agrees with you. The title didnt say who performed the raiding. A police chief could be raided by the FBI.

eis · on Aug 11, 2023

The german site of the source speaks of 0.1mm so you were correct

   > bei Toleranzen von teilweise nur 0,1 Millimeter

https://www.ipp.mpg.de/de/aktuelles/presse/pi/2020/01_20

eis · on Aug 6, 2023

It's not a 100kb model. It's 100kb config files for a several GB model. A small trained layer to stick on top of the real model for fine tuning.

echelon · on Aug 6, 2023

This looks like something between fine tuning a top layer and a zero shot approach.

This is probably what future voice models will begin to look like as they begin to capture prosody and other fine characteristics in a few hundred kb.

bogtog · on Aug 6, 2023

Yes, although it is decently interesting that a model can be fine tuned by just tweaking a small number of weights and training for just a few minutes

eis · on Aug 6, 2023

There is some meat to the story, I agree. But it's not surprising. The fine tuning model of course will be small in file size and not take too long to train because by definition it is applying changes to a small subset of the main model and is trained only on a small amount if input data. You can't use the small tuning model for "Teddies" with a query that has nothing to do with Teddies. You could see these small tuning models as a diff file for the main model. And depending on the user query one can choose an appropriate diff to be applied to improve the result for that specific query.

When you train a model with new inputs to fine tune you can save the weights that got changed to a separate file instead of the main file.

In other words one can see the small tuning models as selectively to be applied updates/patches.

brianjking · on Aug 7, 2023

Isn't this just another method of a LoRa like what we've already seen in Stable Diffusion?

eis · on Aug 5, 2023

One difference is that you are aware that you can't do it and state so. Our current LLMs will just give whatever result they think it should be. It might be correct, it might be off by a bit or it might be completely wrong and there's no way for the user to tell apart from double checking with some non-LLM source wich kinda defeats the purpose of asking the LLM in the first place.

eis · on Aug 3, 2023

Q1 and Q2 of 2022 were negative growth. The past 4 quarters were not and Q2 of 2022 was just barely. So technically there was a brief recession in H1 of 2022. Right now there is no clear sign of a recession as per definition.

I think recessions are also widely misunderstood as being a binary thing. Like going from "everything is A-OK" to "OMG it's all going to shite". There can be a recession which people barely feel. It's not like an event horizon from which there is no turning back.