Hacker Newsnew | past | comments | ask | show | jobs | submit | gbibas's commentslogin

I've been thinking about something like this. It would be really helpful to add another axis here which is the comparability in output across models for different tasks. So like if you are doing some type of classification or typing activity, can you get 95% of the performance of Sonnet with Haiku, or 89% of the performance of Sonnet with Gemma4. Then the cost/capability matrix space becomes more rich because you can decompose tasks and assign out according to cost and capability


I'd love that! Is that comparison data between models available anywhere?


This is cool. I am working on something similar for code schema reads by AI, which cost me a lot of tokens. I’ll share once battle tested. The idea of abstracting and then giving it a tree to follow is where I landed also.


Let me know your findings.


Cool. Thank you for sharing. While AI tools are extremely powerful, packages like this help create some good standards and stepping stones for connectivity that the models haven’t gotten around to yet. Thanks again.


Ofc! Please try it out. Stop by in the Discord or Github Issues if you have any questions!


Mythos and AI infused make some sense, but the thing I keep wondering is that while attacks can be planned and executed by AI, because they inherently we have not yet solved the hallucination problem, any though that AI will help you defend against attacks completely is short sighted. Mythos can find things, but if you ask it if you are secure, can you trust it? It is asymmetric AI warfare because of hallucinations.


You hit on something that AI can be really good at, which is shining light on corporate activities. Salary and movement are great, and interesting, but this could also help parse things like entry and exits into business markets where companies often quietly add or remove things from their filings. Keep going.


you are absolutely right - filings have a ton of signals beyond just the exec changes. Market entries, risk factor changes, material agreements. Lot of room to expand.


This makes sense, but it also causes concern. With AI whether it is content or programming, losing the new novel approaches which may wind up being better in the long run, get shut down for expediency in the short run. This is nothing new and not AI specific behavior, large comoanies have been doing this forever, but it leads to a death of innovation and a spiral inward of self reinforcing loops. You are absolutely right that llms won’t know it and will need to learn something like this all over, but they are good at that and if we stop to find better patterns (which is what humans are great at doing) we keep creativity alive and find meaning while making our work more productive in the long term.


That's a different conversation than the one I'm having. I haven't made any argument against making new things.

I'm saying "Django, but different" isn't for agents. It makes agents work harder, in general.

Make anything you want. Just don't lie to yourself and others about who it's for.


I live here in CA. If it is something that gets attention at all, whether AI or 3d printing or anything else, politicians here feel it is their duty to regulate it. If it should be regulated like politicians spending our money or insider trading, they want nothing to do with it. Less power for us, more power and money for them.


I connected on my iphone. Didn’t connect it to any tools initially and used my throwaway email, as I need to be more comfortable before connecting anything real. Clean interface. I really like the sms connection and could see how that would appeal to less AI tech savvy users. You have done some good planning and UX/Ui. Thanks for sharing


Cheers!


Very Meta and very cool. Well written


Exactly right. Unfortunately, this is likely a reporter who is just looking for something that will get attention. I remember a time when reporters wrote things based on importance, not chasing clickbait like everyone on social media. Whoever Satoshi is/was, they wanted privacy. Let them have it and move on.


You do realize the author is the one that exposed the Theranos fraud ?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: