More

carlio · 2026-03-01T05:05:31 1772341531

An AMD AI max+ 395 - I use the one from frame.work (https://frame.work/de/en/desktop) with 128GB unified RAM and it can run a 120b model (gpt-oss:120b) just fine.

See Wendel's review here - https://www.youtube.com/watch?v=L-xgMQ-7lW0

There are other mini-pc manufacturers, the mainboard is the important part.

lubitelpospat · 2026-03-01T15:23:44 1772378624

Wow, that's quite beefy.

carlio · 2025-06-11T13:48:06 1749649686

minio is an S3-compatable object store, the linked s3mini is just a client for s3-compatable stores.

carlio · 2025-06-05T18:51:16 1749149476

It'd look like this: https://www.smbc-comics.com/comic/copyright

carlio · 2025-06-03T20:19:05 1748981945

"Something went wrong. Please try again".

carlio · on March 23, 2025

Do you suggest not doing it then?

rdtsc · on March 23, 2025

I suggesting doing it faster and acknowledging it was a mistake not doing it sooner.

carlio · on Feb 14, 2025

While you might be able to continuously update the model, are you able to continuously update the moderation of it? As the article says, it takes time to tune it and filter it; if you allow any content in without some filtering of outputs you might end up with another Tay. You'd have to think the liability would slow down the ability to simply update on the fly.

Also, if the proportion of training data available is larger for more established frameworks, then the ability of the model to answer usefully are necessarily dictated by the volume of content which is biased towards older frameworks.

It might be possible with live updating to get something about NewLibX but it probably would be a less useful answer compared to asking about 10YearOldLibY

zelphirkalt · on Feb 14, 2025

Moderation is the real reason it will be difficult to have online learning models in production. I think the technical side of how to do it will not be the biggest issue. The biggest one will be liability for the output.

carlio · on Oct 24, 2024

That reminds me of the story of the 500 mile email (https://www.ibiblio.org/harris/500milemail.html)

MrLeap · on Oct 24, 2024

I read this yeaaaars ago. I'm about to re-read this, but before I do, I think this was the article that installed a little goblin in my brain that screams "TTS" in instances like this. I will edit this if the article confirms/denies this goblin.

EDIT: mostly, probably, sort of.

debuggerpk · on Oct 24, 2024

Funny story. He must thank the department of statistics for the quick turn around.

carlio · on March 28, 2024

It takes too long to iterate on a character design.

For more explanation: I've been playing around with stable diffusion on my laptop recently; I have a gtx 4070 with 8GB dedicated VRAM so it's not nothing.

The main problem I have is that it takes a lot of iteration on a prompt to get what I want, at lower resolution and sampling steps, before I know that I'll get roughly what I want.

I tried making a character in Eggnog, and before I could be sure what I was getting, it told me it'd take 15-20 minutes to be ready. I worry that this will just make me wait a long time for a character that isn't what I want, and starting again too many times will put me off.

The iteration and feedback loop needs to be tighter in my opinion, or people will get unsatisfactory results and be unwilling to go back and fine tune.

samplank2 · on March 28, 2024

Thanks, this is helpful feedback. We're definitely frustrated with how long it takes to load a character. We'll see what we can do to give a better sense of what the character will look like before the training job kicks off. We should be able to show some intermediate results.

carlio · on Oct 19, 2023

I think you're talking about spinning up a temporary environment running the code and connecting via a local IDE to inspect it, whereas OP is talking about hosting the IDE remotely.

zemnmez · on Oct 19, 2023

No, Google actually runs a remote web IDE called Cider. The latest version is derived from VSCode.

moandcompany · on Oct 19, 2023

At Google, people can use "Cider" which is a web browser based IDE, and they can use a "Cloudtop" which is a desktop virtual machine provisioned via Google's cloud infrastructure, as alternatives to a dedicate physical workstation.

paxys · on Oct 19, 2023

Nope, the VM has the git repo + IDE binaries + everything else. You can code against it using just a web browser if you want.

carlio · on Sept 19, 2023

I use InfluxDB for this, it comes with a frontend UI and you can configure Telefraf as a statsd listener, so the same metric ingestion as datadog pretty much. There are docker containers for these, which I have added to my docker-compose for local dev.

I think it does log ingestion too, I haven't ever used that, I mostly use it just for the metrics and graphing.

pighive · on Sept 19, 2023

Do you mind sharing any publicly available examples of this set up on github or somewhere? TIA

corytheboyd · on Sept 19, 2023

That sounds very promising indeed! It might be enough for what I’m after for my projects!