> We had a budget alert (€80) and a cost anomaly alert, both of which triggered with a delay of a few hours. By the time we reacted, costs were already around €28,000.
I had a similar experience with GCP where I set a budget of $100 and was only emailed 5 hours after exceeding the budget by which time I was well over it.
It's mind boggling that features like this aren't prioritized. Sure it would probably make Google less money short term, but surely that's more preferable to providing devs with such a poor experience that they'd never recommend your platform to anyone else again.
The main function of OpenClaw was for people to signal how advanced and cutting edge and thought-leader-y they were. All those Mac minis are sitting idle now.
I’m a professional maintenance gardener and I have used NanoClaw running on my Mac to do the following:
Schedule jobs on my job management tool (I vibed it) using a custom MCP. I’ll ask it questions like “what jobs are on today” etc. start the job, complete the job etc.
It will watch Gmail using a MCP for work orders from local real estate agents, where it will schedule quote visits.
After the quote visit I add the photos to telegram (the channel I happen to use) where I then ask it to analyse the photos.
Claude in NanoClaw does a good job of figuring out what needs to be done, but it doesn’t always get it quite right, so I use intake-api which is a “session inbox” [1] that generates a form and uploads it to Cloudflare along with the images and puts a link to the form in the chat so I can make adjustments to the annotated images of the property. After I’m happy, I click submit.
I’ll then go back to the telegram channel and let it know I have submitted the form. Claude will then pull the JSON payload back down into the session and integrate it into the quote.
It will ask me questions about what the job will cost etc, and anything else it thinks it needs to know.
It will then generate a full PDF proposal using Latex between 14 and 32 pages long depending on how many photos were taken.
There are sections with terms and conditions as well as a bit of sales guff etc. as well as quite a nice cover page with their contact details and mine etc.
When I’m happy I’ll ask it to create a draft email in Gmail with the attached pdf proposal.
I’ll review it quickly and then send it.
I also have the Xero MCP setup so I can ask it to create invoices and contacts etc.
I do all of this when getting in and out of my truck.
It’s freed up my home life to spend more time with my children and my Mrs.
There is alot of scope for small business owners who need these sorts of agentic assistant tasks and with my Agentic CRM I’m see a glimpse of the future for guys like me I think.
I too don't feel like exercising after work, or at 6 in the morning. And many years ago I determined that without exercise my life is a living hell. Simply, I need to push weights and exert myself physically to keep mentally sane.
So I've found work that works for me, as in, I can go to the gym at lunch, which is when I feel good about working out.
Make your routine work for you, and not the other way around. Prioritising yourself is exactly like the plane safety announcement, place the mask on your face before assisting others, because you're no good to them passed out. Same thing with your health, make it a priority to look after yourself and feel good, or you won't be your be able to help anyone, and you won't be a good version of yourself people will want to hang with.
The policy of applying US immigration enforcement actions against legal visa holders who have attended specific legal (US based) protests has been publicly reported and confirmed by many government officials and is unrelated to anyone trying to enter the country.
Senior ICE officials have testified under oath in federal court that analysts were moved from counterterrorism, global trade, and cybercrime work to this group focused specifically on writing reports about people involved in student protests.
There's still the question of access to the codebase. By all accounts, the best LLM cyber scanning approaches are really primitive - it's just a bash script that goes through every single file in the codebase and, for each one and runs a "find the vulns here" prompt. The attacker usually has even less access than this - in the beginning, they have network tools, an undocumented API, and maybe some binaries.
You can do a lot better efficiency-wise if you control the source end-to-end though - you already group logically related changes into PRs, so you can save on scanning by asking the LLM to only look over the files you've changed. If you're touching security-relevant code, you can ask it for more per-file effort than the attacker might put into their own scanning. You can even do the big bulk scans an attacker might on a fixed schedule - each attacker has to run their own scan while you only need to run your one scan to find everything they would have. There's a massive cost asymmetry between the "hardening" phase for the defender and the "discovering exploits" phase for the attacker.
Exploitability also isn't binary: even if the attacker is better-resourced than you, they need to find a whole chain of exploits in your system, while you only need to break the weakest link in that chain.
If you boil security down to just a contest of who can burn more tokens, defenders get efficiency advantages only the best-resourced attackers can overcome. On net, public access to mythos-tier models will make software more secure.
Attorney admitted in NY here. It's fascinating that Judge Rakoff likely would have come to the opposite conclusion if the Claude chat was at the attorney's request or suggestion. I am surprised the court placed so much reliance on the Terms of Service, which are probably not so different than those of Outlook, Gmail, etc., say, yet nobody disputes that attorney-client emails remain privileged notwithstanding the Terms of Service of those providers. At least I have never seen anyone argue in NY that privilege is waived by emailing. And unlike sending an email to another person, chatting with Claude is a solo conversation more like organizing one's notes, which if in contemplation of obtaining legal advice seems privileged to me. I think this is a very close question and am not sure it would come out the same way in other courts or on even slightly different facts. Very interesting legal question.
I fear that outside of cataclysmic global warfare or some sort of butlerian jihad (which amounts to the same) this genie is not going back into the bottle.
This tech is 100% aligned with the goals of the 0.001% that own and control it, and almost all of the negatives cited by Kyle and likeminded (such as myself) are in fact positives for them in context of massive population reduction to eliminate "useless eaters" and technological societal control over the "NPCs" of the world that remain since they will likely be programmed by their peered AI that will do the thinking for them.
So what to do entirely depends on whether you feel we are responsible to the future generations or not. If the answer is no, then what to do is scoped to the personal concerns. If yes, we need a revolution and it needs to be global.
*Donald Knute -> Donald Ervin Knuth is the author of the book "The Art of Computer Programming" (in progress for a couple of decades, currently volume 4c is being written). It is quite advanced, and it will likely not cover compilers anymore (Addison-Wesley had commissioned a compiler book from Knuth when he was a doctoral candidate, now he is retired and has stated his goal for the series has changed).
I disagree with the author's point: the "Dragon book"'s ("Compilers: Principles, Techniques, and Tools" by Aho et al.) Chapter 2 is a self-sufficient introduction into compilers from end to end, and it can be read on its own, ignoring the rest of the excellent book.
Another fantastic intro to compiler writing is the short little book "Compilers" by Niklaus Wirth, which explains and contains the surprisingly short source code of a complete compiler (the whole book is highly understandable - pristine clarity, really) and all in <100 pages total (99).
(I learned enough from these two sources to write a compiler in high school.)
The older I get the more sensitive to a single poor night's sleep I become.
The most frustrating effect is that even a few drinks in the evening (maybe over 2-3 units). Unsettles my sleep that if I'm in the process of learning something feels like it sets me back several days.
That's not even counting the slowed processing I feel, and lower productivity the next day.
I genuinely have to revisit old information.
A genuine hangover from a heavy night can put me out of action for half a week!
When I was younger I'm not sure I had many good nights sleep let alone noticed a bad one!
I've heard that small amounts of alcohol can actually improve learning interestingly by preventing interference from events later in the day.
I feel bad that people have to read this. It's complete puffery, made up for clicks, and the biggest thing is the pure bravado with which a company says, "Hey, let's just waste a ton of money, all for a potential blog and marketing piece." This is not really automated in any fashion. I was dubious at first, but then I saw the screencaps showing the devs interacting with Luna via a Slack workflow with a human in the loop — meaning they're literally just proxying their own behavior through an LLM. This is no different than anyone who consults AI for any decision with context. To get even more technical on the fallacy: this is not automation, as there is data leakage at every step where there is a human in the loop. A broken clock is right twice a day; an LLM could cycle through 100 guesses to pick a number, but don't market that as an oracle. Aside from that, you could just look at the pictures and context (retail in SF) and assume making a profit here would be near impossible. An actual AI ceo would probably have immediately cancel the lease.
> Immigration authorities say the move is aimed at preventing cases in which foreign workers obtain visas under one category, but then engage in unrelated or lower-skilled work.
The claim appears to be that people were using up visa slots for things like interpreters or other jobs where clearly you'd need good language skills to actually do the job, including in Japanese, with the intent all along of doing some other job instead. An up-front test should let through almost all of the legitimate claimants of these visas, and stop almost all the fraudsters. Probably a lot cheaper than a similarly-effective level of after-the-fact auditing, or more-extensive checks into applicants' work situation.
[EDIT] I mean, in the framing provided by the government, the above appears to be what's going on. Governments may lie, of course.
I'm running it for the first time and this is what the thinking looks like. Opus seems highly concerned about whether or not I'm asking it to develop malware.
> This is _, not malware. Continuing the brainstorming process.
> Not malware — standard _ code. Continuing exploration.
> Not malware. Let me check front-end components for _.
Paper Computing (great name!) is something I've been thinking about a lot to help my kids benefit from tech without exposing them to the brain melting addiction of screens. I sacrificed a few crazy nights of sleep to try to build a Paper Computer Agent prototype for a recent Gemini hackathon (only to disappointingly have submission issues right before the actual deadline) which my kids loved and keep asking me to set up permanently for them.
It's essentially a poor man's hacked up DynamicLand - projector, camera, live agent. There are so many things you could do if you had a strong working baseline for this. My kids used it to create stories, learn how to draw various things, and watching safe videos they could hold in their hand.
There's something weirdly compelling and delightfully physical about holding a piece of paper that shows a live rocket launch, with the flames streaming down the page. It could also project targeted pieces of text, such as inline homework advice, or graphs next to data. It doesn't take long to imagine any other number of fun use cases, and it feels a lot more freeing and inspiring than keeping everything bound to a screen.
The people. We voted for the people who gave the power, and we re-elected them. It’s really that simple. Is it “too late” now? maybe, but we had ~25 years since this all started post 911 to react, and chose not to.
I am personally of the opinion that ML will end up being 'normal technology', albeit incredibly transformative.
I think you can combine 'Incanters' and 'Process Engineers' into one - 'Users'. Jobs that encompass a role that requires accountability will be directing, providing context, and verifying the output of agents, almost like how millions of workers know basic computer skills and Microsoft Office.
In my opinion, how at-risk a job is in the LLM era comes down to:
1: How easy is it to construct RL loops to hillclimb on performance?
2: How easy is it to construct a LLM harness to perform the tasks?
3: How much of the job is a structured set of tasks vs. taking accountability? What's the consequence of a mistake? How much of it comes down to human relationships?
Hence why I've been quite bullish on software engineering (but not coding). You can easy set up 1) and 2) on contrived or sandboxed coding tasks but then 3) expands and dominates the rest of the role.
On Model Trainers -- I'm not so convinced that RLHF puts the professional experts out of work, for a few reasons. Firstly, nearly all human data companies produce data that is somewhat contrived, by definition of having people grade outputs on a contracting platform; plus there's a seemingly unlimited bound on how much data we can harvest in the world. Secondly, as I mentioned before, the bottleneck is both accountability and the ability for the model to find fresh context without error.
There seems a fair enthusiasm in the UI of these to hide code from coders. Like the prompt interaction is the true source and the actual code is some sort of annoying intermediate runtime inconvenience to cover up. I get that productivity can be improved with a lot of this for non developers, just not sure using 'code' as the term is the right one or not.
p.s.
Normally we downweight subsequent articles in a series because avoiding repetition of any kind is the main thing that keeps HN interesting. But we made an exception in this case. Please don't draw conclusions from that since we'll probably get less series-ey, not more, after this! Better to bundle into one longer article.
As someone who works on closed source software and has done for a couple of decades, most companies won't even know about that and of those who do only a fraction give enough of a shit about it to do anything until they are caught with their pants down.
Funny because many people here were so confident that OpenAI is going to collapse because of how much compute they pre-ordered.
But now it seems like it's a major strategic advantage. They're 2x'ing usage limits on Codex plans to steal CC customers and it seems to be working. I'm seeing a lot of goodwill for Codex and a ton of bad PR for CC.
It seems like 90% of Claude's recent problems are strictly lack of compute related.
Upvoting because Shorts are terrible. Flagging because the submission title - which is 100% faithful to the friendly article - is a complete lie!
Plug: I added a bunch of features to Control Panel for YouTube [1] which let you either hide Shorts completely, everywhere (which is the default) or take more control of how you use them if you do (e.g. redirecting to the normal video player)
Technically incorrect, Supreme Court precedent has held that aliens are entitled to lesser First Amendment protections while seeking to enter the United States. You could be on US Soil (i.e. entering customs at an airport) and those protections don't apply.
The person in question said he was in Geneva when he received the email from Google. Therefore is a non-US citizen residing outside the country entitled to 1A protections for something they said or did while in the US? I'm not expressing an opinion but I wouldn't take that statement as legal advice.
> If I hand wrote some notes in a notebook or diary, I wouldn't have to hand them over, as I understand it, even with no lawyer in the mix. Same if I wrote some notes in a text file on my computer.
Absolutely wrong in the U.S. The police can't just break into your home and demand it, but a judge can 100% mandate discovery or a subpoena if there is reason to believe that evidence exists which is relevant to the case.
The 4th amendment prohibits UNREASONABLE search and seizure, and we let judges make that determination. You never have absolute privacy rights.
These types of huge perfect specimens always take my breath away when I am able to see them in person. To think that this kind of stuff just kinda exists buried in the earth...
I am a part of a local mineral club which hosts several "field trips" a year to various mineralogically interesting locations (most of which aren't accessible as an individual, like private land and special digs at active mining/quarrying sites on their days off). I have never found anything even remotely as beautiful as the specimens shown, but the small collection of mildly interesting things that I've smashed out of the earth with my own 2 hands is amazingly satisfying to me. You don't even have to be a super dedicated "rock nerd" to take part, I highly recommend looking for local mineral clubs to join if this even remotely interests you. It's really a ton of fun!
According to a recent CRYPTO-GRAM issue from Schneier, it's in Meta's interest to push these regulations as their product isn't an OS. Their competition (Apple/MS/Google) are OSs though.
I had a similar experience with GCP where I set a budget of $100 and was only emailed 5 hours after exceeding the budget by which time I was well over it.
It's mind boggling that features like this aren't prioritized. Sure it would probably make Google less money short term, but surely that's more preferable to providing devs with such a poor experience that they'd never recommend your platform to anyone else again.