More

mikehotel · 2026-01-20T03:02:46 1768878166

An effort led by security research lab CovertLabs is actively uncovering troves of (mostly) AI-related App Store apps that leak and expose user data, including names, emails, and chat history.

Most apps on Firehound appear to expose data via improperly secured databases or cloud storage, and many listings disclose the underlying data schemas and record counts.

mikehotel · 2026-01-19T12:30:50 1768825850

Apple/Google have the financial brawn to push a disrupting technology into more common use. And this is not encumbered by any restrictive licenses.

alt227 · 2026-01-19T15:49:46 1768837786

It really isnt a disrupting technoology. It doesnt work as soon as you are far away from any other humans with phones.

mikehotel · 2025-10-05T20:40:00 1759696800

If your threat model includes the TLA types, then backup to a physical server you control in a location geographically isolated from your main location. Or to a local set of drives that you physically rotate to remote locations.

mikehotel · 2025-10-05T20:35:56 1759696556

Decryption is not usually an issue if you encrypt locally.

Tools like Kopia, Borg and Restic handle this and also include deduplication and other advanced features.

Really no excuse for large orgs or even small businesses and somewhat tech literate public.

mikehotel · 2025-10-04T01:10:54 1759540254

Thanks for posting. Love to see rust as strategic direction for MS and how they are using it in core OS, Azure and security areas, and much of it open source.

I still use https://sysinternals.com (though not via their live channel).

mikehotel · 2025-08-06T18:38:29 1754505509

The performance improvements are impressive:

> In Automerge 3.0, we've rearchitected the library so that it also uses the compressed representation at runtime. This has achieved huge memory savings. For example, pasting Moby Dick into an Automerge 2 document consumes 700Mb of memory, in Automerge 3 it only consumes 1.3Mb!

> Finally, for documents with large histories load times can be much much faster (we recently had an example of a document which hadn't loaded after 17 hours loading in 9 seconds!).

steve_adams_86 · 2025-08-06T21:42:48 1754516568

I wonder if this is accomplished using controlled buffers in AsyncIterators. I recently built a tool for processing massive CSV files and was able to get the memory usage remarkably low, and control/scale it almost linearly because of how the workers (async iterators) are spawned and their workloads are managed. It kind of blew me away that I could get such fine-tuned control that I'd normally expect from Go or Rust (I'm using Deno for this project).

I'm well above 1.3mb, and although I could get it down there, performance would suffer. I'm curious how fast they sync this data with such tiny memory usage. If the resources were available before, despite using 700mb of memory, was it still faster?

These people are definitely smarter than I am so maybe their solution is a lot more clever than what I'm doing

edit: Oh, they did this part with Rust. I thought it was written in JS. I still wonder: how'd they get memory usage this low, and did it impact speed much? I'll have to dig into it

skirmish · 2025-08-06T21:55:13 1754517313

They say: "In Automerge 3.0, we've rearchitected the library so that it also uses the compressed representation at runtime. This has achieved huge memory savings."

steve_adams_86 · 2025-08-06T22:01:03 1754517663

Right, this didn't click at first but now I understand. I can actually gain similar benefits with my project by switching to storing the data as parquet/duckdb files; I had no idea the potential gains from compressed representations are so significant, so I'd been holding off on testing that out. Thanks for the nudge on that detail!

leeoniya · 2025-08-07T03:02:23 1754535743

> I recently built a tool for processing massive CSV files and was able to get the memory usage remarkably low

is it OSS? i'd like to benchmark it against my csv parser :)

steve_adams_86 · 2025-08-07T19:04:25 1754593465

No, it's very specific to some watershed sensing data that comes from a bunch of devices strewn about the coast of British Columbia. I'd love to make it (and most of the work I do) OSS if only to share with other scientific groups doing similar work.

Your parser is almost certainly better and faster :) Mine is tailored to a certain schema with specific expectations about foreign keys (well, the concept and artificial enforcement of them) across the documents. This is actually why I've been thinking about using duckdb for this project; it'll allow me to pack the data into the db under multiple schemas with real keys and some primitive type-level constraints. Analysis after that would be sooo much cleaner and faster.

The parsing itself is done with the streams API and orchestrated by a state chart (XState), and while the memory management and concurrency of the whole system is really nice and I'm happy with it, I'm probably making tons of mistakes and trading program efficiency for developer comforts here and there.

The state chart essentially does some grouping operations to pull event data from multiple CSVs, then once it has those events, it stitches them together into smaller portions and ensures each table maps to each other one by the event's ID. It's nice because grouping occurs from one enormous file, and it carves out these groups for the state chart to then organize, validate, and store in parallel. You can configure how much it'll do in parallel, but only because we've got some funny practices here and it's a safety precaution to prevent tying up too many resources on a massive kitchen-sink server on AWS. Haha. So, lots of non-parsing-specific design considerations are baked in.

One day I'll shift this off the giga-server and let it run in isolation with whatever resources it needs, but for now it's baby steps and compromises.

leeoniya · 2025-08-08T03:58:00 1754625480

thanks!

mikehotel · 2025-08-06T16:39:20 1754498360

- single binary file deployment

- TUI based configuration

- API endpoints

mikehotel · 2025-06-23T15:51:36 1750693896

See https://archive.ph/oXYXe for more info about TeleMessage version of Signal approved for use by government offices.

unethical_ban · 2025-06-23T18:26:55 1750703215

Are "paid for" and "properly approved for classified information" being conflated here? I may have missed something.

mikehotel · 2025-06-23T15:39:05 1750693145

If you don’t mind supervised mode, this can help prevent this bypass: https://www.techlockdown.com/blog/prevent-turning-off-wifi-i...

mikehotel · 2025-05-22T17:32:34 1747935154

Congrats on publishing!

It seems like a very polished and better integrated version of https://www.when2meet.com/.

You say you do not collect info. Are you saving the meeting details and availability in a database?

devgoth · 2025-05-22T17:41:23 1747935683

there is no external server! all meeting information is stored as metadata on the message.

this leads to some issues with potential collisions like if two people click the Whenish message at the same time and submit their message, there is no way to merge both that data. while this is an issue, i wanted to err on the side of privacy as much as possible and not rely on a server at all.