Hacker Newsnew | past | comments | ask | show | jobs | submit | sbuttgereit's commentslogin

Sort of... this was version 3 of the engine, a fairly big redesign and for version 3 this was the first flight.

I think that's part of it, but not necessarily the whole story. I haven't criticized them in the thread yet... so here goes.

Previously, I posted critically not because they were running businesses without humans, but because their post just described going through the motions without actually discussing if it really was effective or not. Sure the AI got through the day, checked off tasks on the list, but did it actually do that effectively or efficiently in any important way? Who knows... wasn't discussed.

I think where I come down now is that repeats of this same gimmick feel like just that: they're just playing a gimmick for attention. I can't tell that they're really demonstrating any special or significant capability... but man, just the story of trying to run a business without humans will get you that sweet, sweet attention.

Unfortunately, looking at least the first post, I stopped reading their "we let AI run X" posts. I think the only thing I really came away with is how thoughtless and mundane are most aspects of running a small business actually is; something I knew, but it really drove the point home. I didn't learn anything unexpected about AI tools or their products that seemed compelling or unexpected.


Easily one of the best values in commercial software if you have a need for what it does. I think I paid something ~$70 a couple of years ago. While there's a limitation on the number of updates you get based on release version, I'm still getting updates under the license a couple years on. All that and you get a genuinely professional level tool for much less than what similar software from competitors offer.

I couldn't more highly recommend it.


I'm no musician but I paid for a Reaper license just because the software is so good and useful and the licensing scheme is so reasonable. Like, it's kind of hard to beat that.

> I thought this is technically impossible

No, very technically possible... though, with good randomness, very, very unlikely.

But nothing technically prevents a UUIDv4 from generating a duplicate value.


There are a number of assumptions in what you say that don't necessarily hold.

1) That school is simply about landing a job.

2) That there is a value in students knowing how to have the AI do problems for them.

3) That follow-on effects of manually solving difficult problems is discountable compared to the direct output of the work.

I would say you're absolutely correct in that people pay for the result and they don't really care how you got there. But that's a pretty shallow rationale which overvalues the ability to be the conduit from the source of requirements to the final output and undervalues the individual ability to think for one's self when faced with the challenges of technological, geopolitical, or simply uncontrolled personal circumstances.

"The conduit", who you seem to be believe is the one with marketplace advantage, is exactly the person I would say is the most vulnerable. Not because getting the AI to produce demands is without value, but that its quickly becoming a task that doesn't need the intermediary at all. Those magicians that can prompt/agent/mcp/etc their way through to positive successes are actively being challenged by the very AI producers which our conduits people now depend on. Removing the need for intermediaries would be a great competitive advantage for any AI vendor able to achieve it. But insofar as intermediaries create output from LLMs, they'll not be very well differentiated: the common wisdom tends to be the output, lest the AI be accused of hallucination or being overly supportive. But when everyone is using AI for everything the opportunities will be in arbitraging that which is missed by common wisdom... filling in the cracks that any responsible AI would simply never venture to consider. Our conduit-person will be at a decided disadvantage because it takes real thought to know when it's best to color within the lines, and when it's best to not do so.

And that's really it. A good education is teaching you about the process of thought and becoming practiced at thinking. I would expect a better educated, thinking person to more easily adapt and make use of technology such as generative AI to solve problems more so than a person that just knows how to deal with today's prompting needs. The thinking person will be able to understand the bigger picture to better get a consistent and high quality series of results than the person just getting results as needed.

And that's really it. The output of a good education is you as a thoughtful & knowledgeable person: the output on the page is merely a means to that end. But if you focus solely on the answer on the page and the only important thing... you're really evaluating the AI, not the person that acted as intermediary.

In otherwords, if the person following your advice comes for a job, simply ask them which AIs they used in the interview and then just sign contracts with those vendors instead... you'll get better bang for your buck cutting out the middleman.


Looking at the discussion below this comment, I'd just add this video by AlphaPheonix:

https://www.youtube.com/watch?v=2Vrhk5OjBP8

Good discussion in the comments there as well.


I skimmed through this, and maybe I missed it... but what really are they trying to prove? Are they trying to show that AI is capable of arbitraging consumer desires vs. market products/services into a successful business? Are they trying to show that once you get to financially managing a business that the ruthlessly efficient demands of the AI can mean points to your margins? Or are they simply trying to get attention in an otherwise arguably overcrowded market for AI service s (maybe the AI suggested something like this)?

The only thing that I saw demonstrated, and again, I skimmed, is what many thousands of software developers using AI tools to write their boilerplate already know: these tools, as of now, are great at going through the motions. A successful retail business, and I spent many years in the retail industry, isn't about putting together a nice store front, hiring clerks, and selecting just any-old-products: it's about being profitable. In traditional retail one of most important things is getting the right real estate for your target market... seems like that choice was made already in this case. Yes, a nice store front and good clerks are important, but I've worked in chains which were immaculately designed and built stores with great clerks that failed... and some that opened little more than fluorescent lighted hellscapes with clerks that barely cared that succeeded. In both cases the overall quality of the decisions and strategies relative to the target markets mattered to the success of the business. Just going through the motions didn't.

So if all is this is to say AI can do the things people generally do in these circumstances then sure, you didn't need this much human effort to prove that.... developer types do that at scale everyday now. If there was something different that this company is trying to learn, I'd be much more interested in that.


If I'm being charitable, it's more about the ability to orchestrate and resolve tradeoffs across these different tasks / domains? The overall C&C, presumably. Which is still not so surprising.

Really it's an excuse for the company to test all the harnesses and tools they have built to make it work.


i agree that some of these things we could have already guessed-- like yes agents can research stuff and order stuff off the internet. I think what will be a lot more interesting is the interactions that happen between Luna the agent running things and the employees it hired. I guess less about AI being able to do the procurement CEO level stuff, and more how it does the HR level aspects of store management. That seems more important in the log run, because like you said, we already know capabilities are there. I think what Andon Labs is doing is more about the safety aspect now. Seems that way at least with how transparent they are about Luna losing money and messing up lol


They're trying to get noticed so that a wealthy cult member's brain gets tickled to the tune of 9 figures


I think this really needs to be party of the message. It's great that Claude found a vulnerability that apparently has been overlooked for a long time. It's even proper for Anthropic to tout the find. But we should all ask about the signal to nose ratio that would have been part of the process. If it only was successful... That would be worth touting, too. But I expect there was more noise than they'd care to admit.

Or put another way, the context matters.


I have to agree with you. We don’t talk nearly enough about the real signal to nose ratio.

(Sorry. I couldn’t resist lol)


They've pretty clearly demonstrated the ability to get to orbit but have, quite reasonably, not actually put the thing into orbit. Given the size of the rocket they've been needing to demonstrate things like the relight for control after achieving orbit and have prioritized other issues like figuring out reentry.

So yes, you are literally correct in that they haven't put one in orbit, but it's more out of caution than capability. What they've only demonstrated in the most recent tests is that they have good reason to believe to believe that they can deorbit in a controlled fashion. But... now they've upgraded everything: raptor 3, booster v3, starship v3. Those need to prove out those capabilities again.

So I wouldn't be surprised if they continue the suborbital program for the next 3 or 4 tests. Given all the redesign, they aren't exactly at the beginning, but they have to show that they haven't broken what they previously fixed.


As others have mentioned, that's simply not going to tell you anything. AIS can and is often times turned off in such situations and it ships can spoof their location by sending false AIS... something that situations like could encourage, at least one could well imagine.

I find Sal Mercogliano's "What's Going on With Shipping?" to be a better source to understanding what's happening in the Strait. Here's a link to yesterday's episode "Strait of Hormuz 3-Week Recap | What is the Status of the Ships, Transits and Escort Mission?": https://www.youtube.com/watch?v=q64cOs7GN_4


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: