pnocera's comments

pnocera · 2026-02-25T19:45:10 1772048710

A politician communication agent maybe...

pnocera · 2026-01-26T06:36:32 1769409392

I've been playing with Brad Ross's AISP [1] to get a better quality of llm outputs at strategic stages of our basic design / plan / implementation workflows.

A concrete example of this is our Adviser Skill experiment [2]. In most AI workflows, a "reviewer" agent just dumps markdown feedback. Our Adviser doesn't just "talk"; it outputs an AISP 5.1 document ( a kind of "Assembly Language for AI Cognition" )

This document forces the agent to define:

- Strict Type Definitions for the issues identified (e.g., distinguishing between a gap, an edge case, or a missing requirement).

- EARS Rules (Easy Approach to Requirements Syntax) that determine the verdict. For example, a rule might state: "If any issue has a severity of ⊘ (critical), then the workflow MUST halt."

- Formal Evidence: Every "approve" or "reject" verdict must include a confidence score (δ) and a grounding proof (π) that explains why the change matches the original specification.

By treating the agent's output as a proof-carrying protocol rather than just text, we can chain multiple specialized agents (Architect, Strategist, Auditor) who "triangulate" on the codebase. They must reach a formal consensus where the variance between their scores is low.

This shifts the agent's goal from "Finish the task at all costs" to "Prove that this change is safe and correct." It turns out that iterating on the verification logic is much more effective for building reliable systems than just increasing the number of agents running concurrently.

[1] Brad Ross AISP : https://github.com/bar181/aisp-open-core

[2] Adviser skill : https://github.com/pnocera/skilld

pnocera · on Dec 26, 2024

I've been actively exploring ways to achieve a seamless and natural conversation with an AI. I played with detecting silences and punctuation ( STT can detect eg question marks ), but this is clearly not enough for turn detection. I think you made a huge step into that direction. Did you write an article or blog post about how you trained your model ? I'd love to make this work with multiple languages, or things like rhetorical question detection.

pnocera · on Nov 3, 2023

I'm in the same situation. I found this cog project to dockerise ML https://github.com/replicate/cog : you write just one python class and a yaml file, and it takes care of the "CUDA hell" and deps. It even creates a flask app in front of your model.

That helps keep your system clean, but someone with big $s please rewrite pytorch to golang or rust or even nodejs / typescript.

pnocera · on Oct 7, 2023

Writing a z39-50 adapter for a content management system using an esoteric language (4D)

pnocera · on Sept 4, 2023

I'm running a decent k3s 4 nodes cluster + 1 VM for the database at Hetzner for a total cost of 88€/month. It's not that expensive.

benterix · on Sept 5, 2023

What instance sizes? ARM or Intel?

pnocera · on Aug 31, 2023

Tailscale

pnocera · on March 29, 2023

Our teens problem might be they only value the number of likes on their profile. Note that I won't get any credit to what I'm saying : I have less than 10 followers.

WalterBright · on March 30, 2023

My HN karma is 64,000 which means absolutely nothing.

pnocera · on Feb 8, 2022

I'd see a nice fit with Tailscale. Or how to create a secured private network of redundant and HA storage...

adrn10 · on Feb 8, 2022

We haven't looked too much into VPNs, although we do need some plumbing to distribute our whole infrastructure at Deuxfleurs.

pnocera · on April 29, 2021

I'm just wondering if there's a one size fits all solution for authz. I spent a few days on a use case : - users have one or several roles ( these are hierarchical ) - there are some objects in the system ( hierarchical too, eg files and folders ) - there are different features available according to a user's subscription. I ended up with a 30 lines program which given a set of rules calculates who can access what in less than a millisecond. Does it worth an over-engineered mega system ?

finnh · on April 29, 2021

The problem isn't the 30 lines, though. The problem is "millions of users, billions/trillions of objects" and both are non-hierarchical with pairwise sharing etc.

If the requirements were simple, the POSIX model would still work too :)

pnocera · on April 29, 2021

I agree. for my use case, once a user is authenticated, you get his roles and subscription. There's a limited number of features or actions for each object type, and a limited number of object types. So you can get the set of rules in the client to manage UI, and apply the same set of rules on the backend in the API. In this use case the authz calculation time will be the same with a million users and a billion objects.

dboreham · on April 29, 2021

You are not wrong. And this pattern shows up everywhere. e.g. do you need a SaaS for "feature flags", since they're just an if statement?

In the case of authz, the argument for separating it as a concern is that many applications can share the same scheme, and you can have specialized tools for provisioning, auditing, etc.

ogazitt · on April 30, 2021

Exactly. When you cross a certain complexity threshold, it's worth separating concerns. It's true for configuration, it's true for IaC, and also for authorization policy.

danans · on April 29, 2021

> do you need a SaaS for "feature flags", since they're just an if statement?

If you want the ability to remotely enable/disable a feature, then yes.

OJFord · on April 29, 2021

It'd be remiss of us to let left-pad aaS [0] go unmentioned in this thread... For those in today's 'lucky 10,000'^, you're welcome.

There are definitely good arguments for it, services like feature-flagging I mean, and such things are generally relatively low-cost; it's more the risk of adding a 'disappearable' dependency for anything and everything that'd put me off.

(^And if you don't know about this, OMG how can you not have heard about lucky 10k?! Just kidding. [1])

[0] - http://left-pad.io/

[1] - https://xkcd.com/1053/