More

booi · 2026-03-03T08:55:42 1772528142

Tabs can absolutely be entered into cells in multiple ways but the easiest is just copy paste.

And if it’s tab delimited usually people call them tsvs.

booi · 2026-03-03T08:53:36 1772528016

Clearly this is the issue. This article was 2000 words of trying to work around the actual problem

booi · 2026-03-02T19:15:08 1772478908

It's a pretty niche region. It's about 20% more expensive than other major regions so I think it's unlikely to be a core part of most large deployments. That being said, we did hear about it and continue to track it.

techterrier · 2026-03-04T09:20:12 1772616012

the EU (where I am!) is not a niche region and was also taken down

booi · 2026-03-02T18:47:33 1772477253

Perhaps I don't understand it but the encryption security model for MacOS/iPadOS/iOS currently doesn't allow multiple different encryption keys for each user. So any user can decrypt the whole drive and while it does enforce user permissions, the security model can't support true multiuser.

I actually don't know if Windows or ChromeOS support this either but this is certainly something Linux can with LUKS et. al.

hhhAndrew · 2026-03-02T19:48:29 1772480909

Yep on ChromeOS each user's home dir is separately encrypted with their own password.

booi · 2026-02-26T23:03:16 1772146996

arguably if you're using the CLI they still are

spiralcoaster · 2026-02-27T01:01:04 1772154064

Yeah but now they're using npm to install a million packages to do things like tell if a number is greater than 10000. The chances of the programmer wanting to understand the underlying system they are using is essentially nil.

spott · 2026-02-27T00:21:24 1772151684

Yea, they are just much higher level programmers… most programmers don’t know the low level syscall apis.

kube-system · 2026-02-26T23:55:43 1772150143

nah, we have long had other disciplines using the CLI who do not write their own software, e.g. sysadmins

booi · 2026-02-15T13:53:24 1771163604

That’s only 100% tho

booi · 2026-02-14T03:57:06 1771041426

It still blows my mind that they called that crappy partial buggy characterset “utf8”. Then later came out with actual utf8 and called it “utf8mb4”. Makes no sense

evanelias · 2026-02-14T05:03:56 1771045436

They should have addressed it much earlier, but it makes way more sense in historical context: when MySQL added utf8 support in early 2003, the utf8 standard originally permitted up to 6 bytes per char at that time. This had excessive storage implications, and emoji weren't in widespread use at all at the time. 3 bytes were sufficient to store the majority of chars in use at that time, so that's what they went with.

And once they made that choice, there was no easy fix that was also backwards-compatible. MySQL avoids breaking binary data compatibility across upgrades: aside from a few special cases like fractional time support, an upgrade doesn't require rebuilding any of your tables.

nofriend · 2026-02-14T05:14:16 1771046056

Your explanation makes it sound like an incredibly stupid decision. I imagine what you're getting at is that 3 bytes were/are sufficient for the basic multilingual plane, which is incidentally also what can be represented in a single utf-16 byte pair. So they imposed the same limitation as utf-16 had on utf-8. This would have seemed logical in a world where utf-16 was the default and utf-8 was some annoying exception they had to get out of the way.

evanelias · 2026-02-14T05:20:47 1771046447

OK, but that makes perfect sense given utf-16 was actually quite widespread in 2003! For example, Windows APIs, MS SQL Server, JavaScript (off the top of my head)... these all still primarily use utf-16 today even. And MySQL also supports utf-16 among many other charsets.

There wasn't a clear winner in utf-8 at the time, especially given its 6-byte-max representation back then. Memory and storage were a lot more limited.

And yes while 6 bytes was the maximum, a bunch of critical paths (e.g. sorting logic) in old MySQL required allocating a worst-case buffer size, so this would have been prohibitively expensive.

booi · 2026-02-26T23:11:14 1772147474

This still makes no sense. The UTF-8 standard was adopted really in 1998-ish and the standard was already variable using 1 to 4 bytes. MySQL 4.1, which introduced the utf8 charset, was released in 2004.

Even if there were no codepoints in the 4-byte range yet, they could and should have implemented it anyway. It literally does not take any more storage because it is a variable width encoding.

evanelias · 2026-02-28T00:19:36 1772237976

> The UTF-8 standard was adopted really in 1998-ish and the standard was already variable using 1 to 4 bytes.

No, it was 1 to 6 bytes until RFC 3629 (Nov 2003). AFAIK development of MySQL 4.1 began prior to that, despite the release not happening until afterwards.

Again, they absolutely should have addressed it sooner. But people make mistakes, especially as we're talking about a venture-funded startup in the years right after the dot-com crash.

> It literally does not take any more storage because it is a variable width encoding.

I already addressed that in my previous comment: in old versions of MySQL, a number of critical code paths required allocating worst-case buffer sizes, or accounting for worst-case value lengths in indexes, etc. So if a charset allows 6 bytes per character, that means multiplying max length by 6, in order to handle the pathological case.

booi · 2026-02-03T21:22:30 1770153750

Where would this happen? I have never seen an API reflect a secret back but I guess it's possible? perhaps some sort of token creation endpoint?

ptx · 2026-02-03T21:29:32 1770154172

How does the API know that it's a secret, though? That's what's not clear to me from the blog post. Can I e.g. create a customer named PLACEHOLDER and get a customer actually named SECRET?

adastra22 · 2026-02-04T02:57:44 1770173864

This blog post is very clearly AI generated, so I’m not sure it knows either.

saghm · 2026-02-04T08:46:48 1770194808

The point is that without semantic knowledge, there's no way of knowing whether the API actually considers it a secret. If you're using the Github API and have it listed as an approved host but the sandbox doesn't predefine which fields are valid or not to include the token, a malicious application could put the placeholder in the body of an API request making a public gist or something, which then gets replaced with the actual secret. In order to avoid this, the sandbox would need some way of enforcing which fields in the API itself are safe. For a widely used API like Github, this might be something built-in, but to support arbitrary APIs people might want to use, there would probably have to be some way of configuring the list of fields that are considered safe manually.

From various other comments in this thread though, it sounds like this is already well-established territory that past tools have explored. It's not super clear to me how much of this is actually implemented for Deno Sandboxes or not though, but I'd hope they took into account the prior art that seems to have already come up with techniques for handling very similar issues.

mananaysiempre · 2026-02-03T22:22:27 1770157347

Say, an endpoint tries to be helpful and responds with “no such user: foo” instead of “no such user”. Or, as a sibling comment suggests, any create-with-properties or set-property endpoint paired with a get-propety one also means game over.

Relatedly, a common exploitation target for black-hat SEO and even XSS is search pages that echo back the user’s search request.

tptacek · 2026-02-03T22:05:51 1770156351

It depends on where you allow the substitution to occur in the request. It's basically "the big bug class" you have to watch out for in this design.

tczMUFlmoNk · 2026-02-04T02:46:50 1770173210

This is effectively what happened with the BotGhost vulnerability a few months back:

https://news.ycombinator.com/item?id=44359619

Tepix · 2026-02-03T21:28:21 1770154101

HTTP Header Injection or HTTP Response Splitting is a thing.

booi · 2026-02-02T22:07:21 1770070041

Copilot being down probably increased code quality

booi · 2026-02-02T22:04:20 1770069860

that's the best part! They don't!