From my perspective there are some people that have never built real processes in their life that enjoy having some processes now. But agent processes are less reliable slower and less maintenable then a process that is well-defined and architectured and uses llm’s only where no other solution is sufficient. Classification, drafting, summarizing.
I’ve had a Whatsapp assistant since 2023, jailbraked as easy assistant. Only thing I kept using is transcription.
https://github.com/askrella/whatsapp-chatgpt was released 3 years ago and many have extended it for more capabilities and arguably its more performant than Openclaw as it can run in all your chat windows. But there’s still no use case.
I like to experiment with AI flows to make iteration quicker, then once something work investing in is found, back up and build something that's actually repeatable.
Same thing could be said with SKILL.md yet they are highly useful...
Yes you can automate via scripting, but interacting with a process using natural language because every instance could be different and not solid enough to write a spec for, is really handy.
tl;dr: there's a place for "be liberal in what you receive and conservative in what you send", but only now have LLMs provided us with a viable way to make room for "be loosey goosey with your transput"
I understand but there still is usually 80-95% of the skill flow that you can script out that is repeated. Script it out and
simplify your skill, make it more stable, and provide more opportunity to scale it up or down i.e use stronger or weaker models if need be. We should be scripting and forming process first then seeing where we can put AI after that.
The AI for everything thinking is really easy to let infect you. I was trying to figure out how to make some SQL alerting easier to understand quickly. The first thing my brain went to was "oh just shove it into an LLM to pull out the info of what the query is doing". And it unfortunately wasn't until after I said that out loud that I realized that was a stupid idea when you could just run a SQL parser over the query and pull the table names out that way. Far faster, more cost effective, and reliable than asking an LLM to do it.
That’s actually an awesome idea and totally helps to reduce wasting context size - move repeatable instructions to a SKILL.md, and once they’re repeatable and no longer have variability to input, turn it into a tool! Rinse repeat.
Oh nice, you could even eventually turn the whole process including inference into an app so that you’ve cut out the LLM from the whole process saving you execution time
I find that it's usually management that ask for such things "because AI".
I mean using AI is a great way to interpret a query, determine if a helper script already exists to satisfy it, if not invoke a subagent to write a new script.
Problem with your "script" approach is how does that satisfy unknown/general queries? What if for one run you want to modify the behavior of the script?
Exploring with AI doesn’t lead to the same level of learning. They are doing the equivalent of paying to skip the level up of their character and going to the final boss with level 1 armor
I look at it more like speedrunning a level. You're skipping the parts of the level that take up the most time, some times using hacks. Is it universally as much fun as playing the game? No, just like using AI to prototype might get you to the same place, but without the experience of discovery and blockers along the way.
Fully agree with your comment regarding real processes. Being a Six Sigma Black Belt, studying processes and reducing the errors is critical.
The Openclaw processes at the moment scare me.
I’ve had a Whatsapp assistant since 2023, jailbraked as easy assistant. Only thing I kept using is transcription.
https://github.com/askrella/whatsapp-chatgpt was released 3 years ago and many have extended it for more capabilities and arguably its more performant than Openclaw as it can run in all your chat windows. But there’s still no use case.
It’s really classification and drafting.