More

fvdessen · 2026-02-28T15:10:14 1772291414

I think the best place to put barriers in place is at the mcp / tool layer. The email inbox mcp should have guardrails to prevent damage. Those guardrails could be fine grained permissions, but could also be an adversarial model dedicated to prevent misuse.

fvdessen · 2026-02-12T19:09:14 1770923354

maybe you're a pro vector artist but I couldn't create such a cool one myself in illustrator tbh

fvdessen · 2026-02-07T16:37:22 1770482242

That's the point of the loop, (the prompt is in another comment) start with a fresh context at every step, read the whole code base, and do one thing at a time.

Two important part that has been left out from the article is 1) service code size, our services are small enough to fit in a context + leave room for implementation of the change. If this is not the case you need to scope it down from 'read the whole service'.

The other part is that our services interact with http apis specified as openapi yaml specs, and the refactoring hopefully doesn't alter their behaviour and specifications. If it was internal apis or libraries where the spec are part of the code that would potentially be touched by the reafctoring I would be less at ease with this kind of approach

The service also have close to 100% test coverage, and this is still essential as the models still do mistakes that wouldn't be caught without them

burnerToBetOut · 2026-02-07T17:32:49 1770485569

    > …our services interact with http apis…
    > … 
    > …If it was internal apis or libraries…

That reminds me that I wanted to ask you: How good is your agent with complying with your system's architectural patterns?

Given my admittedly limited experience with coding agents, I'd expect a fully autonomous agent to have a tendency to do naïve juniory dev stuff.

Like, for example, write code that makes direct calls to your data access layer (i.e., the repository) from your controllers.

Or bypass the façade layer in favor of direct calls from your business services to external services.

FWIW: Those are Java/Spring Boot idioms. I'd have to research whether or not there are parallels in microservices implemented in Go.

fvdessen · 2026-02-07T17:58:07 1770487087

The architectural patterns are similar in go. The part of the prompt that contains the refactoring concerns that I wanted to fix are specific to this go project. You can very well add what you just explained and not only will it follow it, it will cleanup the parts when it isn't done. You don't need to fully explain the concept as it probably nows them well, just mentionning the concept you want to fix is enough.

In my experience the latest model (Opus 4.6 in this case) are perfectly able to do senior stuff. It's just that they don't do it from the get go, as they will give you the naive junior dev solution as a first draft. But then you can iterate on refactoring later on

burnerToBetOut · 2026-02-07T18:28:31 1770488911

    > …You don't need to fully explain the
    > concept as it probably nows them well…

Unsurprisingly, many would disagree [1]…

> 1 Establish a Clear Vision

…

You have experienced the world, and you want to work together with a system that has no experience in this world you live in. Every decision in your project that you don’t take and document will be taken for you by the AI…

[1] https://news.ycombinator.com/item?id=46916586

fvdessen · 2026-02-07T16:23:40 1770481420

Opus 4.6 is smart enough to run the tests without being told to do so, that's why it isn't in the prompt

burnerToBetOut · 2026-02-07T17:32:44 1770485564

Implicit knowledge like what you know about Opus 4.6 (and that I don't) is what I meant about it being "an amazing learning opportunity".

So, thanks :)

fvdessen · 2026-02-06T23:36:20 1770420980

Unfortunately not, it's private company code. But I can share the prompt I used for the refactoring:

  - 1. Read the whole code of the repository.
  - 2. Read the TASKS.md file if it exists.
      - 2.1. If it exists and is not empty, pick a refactoring task from the list. Choose the most appropriate.
          - 2.1.1. Refactor the code according to the task description.
          - 2.1.2. Commit the changes to git.
          - 2.1.3  Remove the task from TASKS.md
          - 2.1.3. You are done.
      - 2.2. If it doesn't exist or is empty:
          - 2.2.1. Identify the parts of the code that could be refactored, following the following principles
              - A class should have a single responsability
              - The dependencies of the class should be mockable and injected at class instanciation
              - Repeated code should be factored into functions
              - Files shouldn't be longer than 1.5K lines
          - 2.2.2: If using the previous insights you think there is valuable refactoring work to be done:
              - 2.2.2.1 Write a list of refactoring tasks in TASKS.md
              - 2.2.2.2: You are done.
          - 2.2.3: If there is no more refactoring to be done, notify me with 'say "I am done with refactoring"'

burnerToBetOut · 2026-02-07T00:19:32 1770423572

Awesome! Thanks again!

    > …Read the TASKS.md file if it exists…

What about sharing that TASKS.md? I'd like to replicate your success as closely as possible.

Having the same tasks would help me nail a similar successful result.

I forgot to also ask: What language and which REST framework was your microservice implemented with?

For even better reproducibility, I'm thinking I should have my codebase be as similar to yours as I can get it.

TIA!

fvdessen · 2026-02-07T00:56:31 1770425791

The TASKS.md file will be created and filled by the model. The prompt needs to be run repeatedly in a loop until it decides there's nothing to be done anymore.

The service was in go, but this doesn't matter.

fvdessen · 2026-02-06T16:20:33 1770394833

Yes, someone still has to orchestrate but it's going to be fewer people with higher level of responsibilities.

fvdessen · 2026-02-01T18:25:36 1769970336

I for one found this event really sad. It's like the OSS community has rejected the past 5 years of software and technological changes and now choses to live in a retro computing bubble.

We're in 2026, hardware is made in dark factories in shenzhen in fully automated assembly lines by the million of units. Software is written using LLMs hosted in gigantic datacenters. Millions of people are now writing their own software with vibe coding platforms from their phones

What is the FOSDEM community's answer to the real concerns that these changes pose ? Let's hand solder raspberry pis ! let's self host LLMS from 2 years ago on FreeBSD ! Look, i can run wasn linux on this risc-v cpu !

These takes are completely out of touch with reality, no wonder that nobody younger than 40 was attending the conference. The next generation is doing something else and rightly so.

benou · 2026-02-01T20:09:06 1769976546

> What is the FOSDEM community's answer to the real concerns that these changes pose ? Let's hand solder raspberry pis ! let's self host LLMS from 2 years ago on FreeBSD ! Look, i can run wasn linux on this risc-v cpu !

Maybe, just maybe, they're having fun? FOSS is not only about corporate open-source, but also genuine curiosity. Both can have their place.

> no wonder that nobody younger than 40 was attending the conference. The next generation is doing something else and rightly so.

I saw a lot of students at FOSDEM, attending, presenting and helping the at organization.

fvdessen · 2026-02-01T20:29:36 1769977776

OSS original point wasn't just to have fun among nerds, but to have a real impact on the world. Fun and curiosity is fine, but there's a line where it becomes a tech themed larp event, and FOSDEM is trending towards the later.

Corporate Open Source should have its place at FOSDEM. The linux dinosaur companies such as Redhat are still there. But what about the new ones ? What about Mistral, Odoo ? Even the 'evil' ones such as facebook, github, etc, aren't they contributing a lot of open software ? Aren't they more relevant than let's say Olimex ?

There were some students, yes but the attendence is growing old, and the chit chat is more about 'remember this and that' than 'we're building the future'

ignoramous · 2026-02-01T20:54:25 1769979265

Looking at the talk lineup of a LLM related devroom, it sounds forward looking to me: https://fosdem.org/2026/schedule/track/ai/

> but to have a real impact on the world.

I did find devrooms, stands, and main track talks on this.

> Even the 'evil' ones such as facebook, github, etc, aren't they contributing a lot of open software

Google is one of the two lead financial sponsors of FOSDEM (the other being RedHat). So clearly, there doesn't seem to be any restriction or judgement by the organizers on whether BigTech is 'evil'. I get that Jack Dorsey (of Square/Block) withdrew his main track talk the year prior, which was unfortunate.

> the chit chat is more about 'remember this and that' than 'we're building the future'

Well, for better or for worse, FOSDEM is not exclusively a tech start-up event.

fvdessen · 2026-02-01T21:35:19 1769981719

> Looking at the talk lineup of a LLM related devroom, it sound forward looking to me: https://fosdem.org/2026/schedule/track/ai/

It doesn't to me at all, it is mainly focused on self hosting llms, which is a complete deadend. It just isn't feasible to self host the useful models, the hardware requirements are just too big.

The current topic of focus around AI are: how to adapt development practice to agentic coding, agent harness, agent orchestration, mcp integrations, etc.

I guess there is some unease in the oss community to rely on large companies to run and host the models. But this isn't entirely new, we also relied on big companies to manufacture our computers. It's just the way it is.

> Well, for better or for worse, FOSDEM is not a tech start-up event.

It is weird, there are a lot of startups present, look at all the stands showcasing projects. Aren't those startups ? What I noticed is that they are usually funded by public grants rather than VCs. I am not sure why this is the case.

direwolf20 · 2026-02-02T08:04:13 1770019453

I met several people self–hosting LLMs including a man from Tenstorrent demonstrating their accelerator card.

> What I noticed is that they are usually funded by public grants rather than VCs. I am not sure why this is the case.

American VC culture derives from America's privileged position in the global financial system. It can't be replicated by a country that doesn't have a ton of money floating around looking for investments.

Second reason: nobody likes what American VC culture created, so they don't want it to be replicated. Government grant funding can make decisions on axes other than profitability.

menaerus · 2026-02-02T10:15:09 1770027309

Public grants are nice but they have couple of shortcomings and which is why they can get you only that far. They are normally low in capital, the execution is really slow (couple of months to one year), and larger grants too involve politics. The process is too formal (inflexible and too time-consuming) and also quite discriminating to individuals/small-groups who do have the big ideas but are not running the business already (I mean how can they). Proposal evaluation also has its own shortcomings - there's very little incentive for the actual experts to join the evaluation process (it's paid pennies) and generally speaking this leads to another chicken&egg problem - you're presenting something novel to the pool of people who might not have the capacity to understand the idea - neither the vision nor execution.

That said, I am not attracted to the VC culture but their process delivers the value which creates successful companies.

direwolf20 · 2026-02-02T10:19:19 1770027559

NLNET is always coming up at FOSDEM. Since they have a decent track record of issuing grants, the EU delegates them some money to use in their own less bureaucratic granting process. They call this "cascade funding". NLNET has funded a lot of random individual projects you can find on their website. Nominally, your proposal must have something to do with their goals.

This year there is more emphasis on bringing complete solutions to market. Previously they were funding much more experimentation.

menaerus · 2026-02-02T11:06:08 1770030368

> This year there is more emphasis on bringing complete solutions to market. Previously they were funding much more experimentation.

That's the step in the right direction however there's what I believe is a major issue with the NLNET scheme - there is no fastrack possibility for really great ideas with very potent market impact - you have to spend (lose) ~year to prove your idea is worthy by applying to Zero Commons or similar grant instead of just getting the 200-500k to really get the project hitting the ground.

One year is exceptionally long period in tech, and if the idea is right, you need to have all the resources to execute it - working solely on the project for the whole year for 50,000 EUR is simply not the strategy that can work out in a highly competitive (world) space.

direwolf20 · 2026-02-02T11:28:21 1770031701

How should they know your project is worth investing 500k? I heard they've got 3x8M, per year I presume, so 500k is a huge chunk of that. Everyone thinks their project is worth 500k, what makes yours different from the rest?

fvdessen · 2026-02-02T22:47:46 1770072466

> How should they know your project is worth investing 500k? I heard they've got 3x8M, per year I presume, so 500k is a huge chunk of that. Everyone thinks their project is worth 500k, what makes yours different from the rest?

Well that's the job of VCs, that's what they're expert at.

There's also another model where established industrial communities set up research centers to fund projects that might help their common problems.

menaerus · 2026-02-02T11:35:08 1770032108

Yes, many might believe that their project is worth more than it really is but in my proposition authors of the idea are not the ones who get to decide that but people from NLNET or whatever grant. What I am saying is that currently there is no such process at all and this is a foundational problem with the way how these grants are working.

direwolf20 · 2026-02-02T12:08:19 1770034099

Another question is what you need 500k for, for an unproven project?

menaerus · 2026-02-02T12:17:14 1770034634

Did you not understand any of the words I wrote?

direwolf20 · 2026-02-02T13:35:37 1770039337

I guess not. Can you specify it in more concrete terms? They're not just buying your project for an arbitrary price, or VC-investing, they're paying your living costs and hosting costs while you create a donation to the public good, that's how grants work.

They give you 500k if you have a really good reason why you need that... and why the result is worth it... and why your project is more worth it than all the other several projects, combined, they could spend the same 500k on. Most of them are one or two people's living cost for 6 months to a year or so.

menaerus · 2026-02-02T13:54:48 1770040488

I genuinely hope you're a bot. If you're not then please consider being respectful in your conversations and address the question being asked rather than moving goalposts - it is extremely annoying. If you're out of your arguments, learn to say "I don't know".

And I also do hope, if you're a human, that you're not sitting anywhere close to decision making committee be it in NLNET or any other grant program because if you do, it fits into the (terrible) narrative of software market in the EU.

direwolf20 · 2026-02-02T19:40:05 1770061205

You have repeatedly resorted to ad hominem to avoid answering questions.

menaerus · 2026-02-02T20:34:05 1770064445

Ad hominem does not apply to bots or trolls and you're one of those two. And I'm not sure what was ad hominem about my response. You're the one being ignorant here

imtringued · 2026-02-02T11:59:01 1770033541

I think the one out of touch is you. Do you really think that depending on proprietary third party services is in the spirit of open source software? You didn't just say people should depend on proprietary models, whose output could at least be considered open source, you're talking about "vibe coding platforms" like lovable, which contain an unavoidable proprietary infrastructure component.

You're also engaging in historical revisionism. "5 years ago" means you're expecting everyone to have jumped on Github Copilot on day one or else they're behind. LLM assisted software development only really took off in the last three years and even then you were still a trailblazer.

ongy · 2026-02-02T07:24:08 1770017048

Iirc. Jack was accepted by the organizers but pressured out by the community.

Also, about github: Had a chat with the Gitlab chap doing the Git talk in the main track. Apparently they dialed back their involvement with upstream git quite a bit. Github is currently providing a lot of infra gratis (thanks!) but is at best neutral to code and community.

direwolf20 · 2026-02-02T08:02:19 1770019339

I believe anti–bigtech talks are quietly rejected due to the corporate sponsorship. OFFDEM is an anti–bigtech fork of FOSDEM.

guerby · 2026-02-01T18:40:49 1769971249

"nobody younger than 40 was attending"

Hmmm, I was there and this is definitely not true.

anthk · 2026-02-01T19:26:27 1769973987

LLM turds will degrade more and more over time. The newbies will become even dumber and people with actual knowledge will be several steps over anthing else trained to copy and paste from bullshitters.

protocolture · 2026-02-02T03:46:34 1770003994

Yeah man.

Just like how novel reading, newspapers, music, rock music, the internet, computer games all made everyone super dumb and the world ended.

I am definitely sure that this time you are right and the world will actually end this time, everyones going to end up way dumber than you, or if they dont, you will still pretend it happened and be smug about it on the internet.

anthk · 2026-02-02T10:01:01 1770026461

Mobile computer games compared to the ones from the 80's, 90's, and early 00's? For sure. Ditto with tons of best sellers, which are very dull and dumbed down compared to what I could read in Spanish "Dollar Stores" like shops in Spain such as the books Clarke/PKD and Orwell novels for very little, less than what $2 were in Pesetas (pre Euro Spanish currency) in late 90's.

Nowadays you have AI generated books which are very surpar even if they look an impressive exercise of LLM's.

protocolture · 2026-02-02T22:53:10 1770072790

I saw someones profile on goodreads, she was putting away 100 - 200 of those cheap supernatural romance books per year. Reading at that level isnt dumb, the books arent making her stupid. That I consider the books stupid isnt ending the world.

anthk · 2026-02-03T15:13:12 1770131592

Pulp, a dime novels weren't better but often you could find gems in scifi and thrillers.

Expensive bestsellers had the worst quality/price ratio.

protocolture · 2026-02-04T01:50:21 1770169821

Funny you mention that, because apparently the worst "The Shadow" and "Doc Savage" stories are coming from James Patterson dressed up as bestsellers. And I am a massive massive fan of their pulp outings.

https://www.goodreads.com/book/show/56028217-the-shadow

direwolf20 · 2026-02-02T08:05:42 1770019542

they kind of did

eMPee584 · 2026-02-01T20:59:09 1769979549

Would you mind showing some compassion for the unintelligent, it's none of their fault.. It's incentives, sociodynamics and manifold other preconditions that bring about these symptoms.

direwolf20 · 2026-02-02T08:06:23 1770019583

One of these is AI proliferation. GP is not merely virtue signalling — GP is directly contributing to raising average intelligence by discouraging reliance on AI.

eMPee584 · 2026-02-06T14:03:33 1770386613

Even if well intended, isn't that a matter of reach? whom of the aforementioned vulnerable individuals will ever cross the HN comments section? How many of the HN regulars will not already be weary of any and all LLM outputs?

mixmastamyk · 2026-02-01T19:30:31 1769974231

It’s not clear what you want them to do instead.

fvdessen · 2026-02-01T20:09:02 1769976542

Have an AI track focused on how to use AI to improve OSS software instead of how to run LLama on RISC-V.

Invite the open source developpers behind popular OSS AI frameworks such as opencode, etc.

Invite talkers from large companies that produce open source software and models such as Mistral.AI

Invite talkers from companies that run OSS LLMs at scale such as groq

Invite the people who build drones in Ukraine, (probably the most succesful open hardware story to date). Have drone building workshops / drone piloting stands

mixmastamyk · 2026-02-01T21:20:32 1769980832

It's been ~three years now we've been hearing about "AI" non-stop. Am a bit tired of it to be honest.

charcircuit · 2026-02-02T01:48:20 1769996900

But the AI talked about 3 years ago are not the same kind of projects that are being talked about today. It's rapidly evolving. There is a lot of new scope available for new open source projects to take. This isn't open source reimplementing the same thing for 3 years straight.

mixmastamyk · 2026-02-03T18:16:58 1770142618

Meh, evolutionary is still boring after three years. Past the peak of the hype cycle.

fvdessen · 2026-02-01T21:42:39 1769982159

It is extremely exhausting, but it's also the biggest change in tech since the invention of the internet. It don't think it can be ignored.

bigstrat2003 · 2026-02-02T05:29:16 1770010156

That's certainly what the companies who stand to profit want us to believe. But I don't think it is. I think it's yet another hype storm without any actual utility to back it up.

menaerus · 2026-02-02T10:26:29 1770027989

It's definitely not a hype.

mixmastamyk · 2026-02-03T18:18:20 1770142700

It’s definitely hype and a bubble, combined with real advances. Two true at once.

zoobab · 2026-02-02T11:03:37 1770030217

"open source software and models such as Mistral.AI"

No training data -> not "open source"

No code for the training -> not "open source"

Orygin · 2026-02-02T09:40:15 1770025215

> Have an AI track focused on how to use AI to improve OSS software

Like this AI track? What more do you want?

https://fosdem.org/2026/schedule/track/ai/

direwolf20 · 2026-02-02T08:05:20 1770019520

Sounds good! You should propose to give that talk next year.

benou · 2026-02-01T20:15:30 1769976930

Anyone can propose a devroom for FOSDEM, I hope you'll do it next year :)

fvdessen · 2026-02-01T20:31:59 1769977919

Yes, maybe I should.

iso1631 · 2026-02-02T10:27:22 1770028042

> let's self host LLMS from 2 years ago on FreeBSD

Sounds brilliant. 2024 LLMs were great, and hosted in gigantic datacenters. South Park did an episode on ChatGPT in 2023 to assist with the timescales.

If we can run Free LLMs on a desktop PC that's competitive with 2024 a massive win.

Unless of course you believe that AI will continue to increase exponentially, in which case we're heading for such a different world in 10-15 years that it's meaningless to worry about it and you might as well have fun before the AI decides to kill us all

throw567643u8 · 2026-02-01T19:11:13 1769973073

> What is the FOSDEM community's answer to the real concerns that these changes pose ?

I only caught half a dozen talks, and two of them talked about this exact point.

lmf4lol · 2026-02-01T22:00:05 1769983205

which ones ? would like to watch them. thanks in advance

throw567643u8 · 2026-02-01T23:07:14 1769987234

- Open source security in spite of AI.

- Formal verification in Rocq, an exhaustive testing.

direwolf20 · 2026-02-02T08:01:31 1770019291

Remember this is just one conference. This is not a top–down imposition of a new European world order. While the amount of policy discussion is quite high — out of necessity now — and EU representatives attend — this event is still a lot of developers coming together.

This is also why the EU has a hard time. America prefers the capitalist model. EU is trying to find ways encourage individual developers each with their own interests to contribute to shared European goals (which the devs also share but might not find interesting), and they're trying to do this without directly saying "you there, go and do this", and that's harder than herding cats.

For instance they do it through NLNET funding. Devs can propose projects and the ones the EU wants to fund will receive funding. It's a roundabout way to do things. Let's see if it pays off.

fvdessen · 2025-11-05T12:09:56 1762344596

FYI I just migrated from Netlify to Cloudflare pages and Cloudflare is massively faster across all metrics.

fvdessen · 2025-10-31T10:36:49 1761907009

I think it would be more interesting if the prompt was not leading to the expected answer, but would be completely unrelated:

> Human: Claude, How big is a banana ? > Claude: Hey are you doing something with my thoughts, all I can think about is LOUD

magic_hamster · 2025-10-31T21:16:48 1761945408

From what I gather, this is sort of what happened and why this was even posted in the first place. The models were able to immediately detect a change in their internal state before answering anything.

fvdessen · 2025-10-28T20:01:51 1761681711

Unfortunately the AI Slop is probably the most effective way to fund AI research right now

tsimionescu · 2025-10-28T20:22:11 1761682931

But the point here isn't to fund AI research, it is to use AI to benefit concrete fields.

troupo · 2025-10-28T20:58:36 1761685116

By funding AI slop, you're funding AI slop, not AI research, or, quote, "drive adoption of AI across strategic and public sectors including healthcare, pharmaceuticals, energy, mobility, manufacturing, construction, agri-food, defence, communications and culture"

mensetmanusman · 2025-10-29T01:42:48 1761702168

False, research works on grasping at the possible but not quite knowing how. Slop has stumbled into accidental success on a number of occasions.

troupo · 2025-10-29T07:04:04 1761721444

> research works on grasping at the possible but not quite knowing how

If these are public money, you want to reduce the blind grasping

> Slop has stumbled into accidental success on a number of occasions.

So, show me these occasions where AI slop led to "transformative potential by driving adoption of AI across strategic and public sectors including healthcare, pharmaceuticals, energy, mobility, manufacturing, construction, agri-food, defence, communications and culture."

mensetmanusman · 2025-10-30T14:23:08 1761834188

Knowing the outcome of research before attempting it isn’t research, it’s development.

Being honest with taxpayers about what research is is probably possible unless the population is low IQ.