“It’s a shame that @DarioAmodei is a liar and has a God-complex. He wants nothing more than to try to personally control the US Military and is ok putting our nation’s safety at risk.
The @DeptofWar will ALWAYS adhere to the law but not bend to whims of any one for-profit tech company.”
My strange observation is that Gemini 2.5 Pro is maybe the best model overall for many use cases, but starting from the first chat. In other words, if it has all the context it needs and produces one output, it's excellent. The longer a chat goes, it gets worse very quickly. Which is strange because it has a much longer context window than other models. I have found a good way to use it is to drop the entire huge context of a while project (200k-ish tokens) into the chat window and ask one well formed question, then kill the chat.
> The longer a chat goes, it gets worse very quickly.
This has been the same for every single LLM I've used, ever, they're all terrible at that.
So terrible that I've stopped going beyond two messages in total. If it doesn't get it right at the first try, its more and more unlikely to get it right for every message you add.
Better to always start fresh, iterate on the initial prompt instead.
Hey, this has been my experience, too! I like Gemini because I’ve told it the tone and style I like my answers in and the first answer is very, very on point with that. But several times I’ve noticed that if I ask follow-up questions, the style immediately changes for the worse, often no longer following my preferences. I’ve also noticed that in follow-ups it makes really bad analogies that are not suitable at all for the kind of audience that the first response is catered to. I’ve been clicking the thumbs-down button every time I’ve seen this and commenting on the change in style and quality, so hopefully the training process will ingest that at some point.
That was a great one, as was the Alien game. I'm sure there was shovelware crap too, but it seems (in the glorious golden hindsight of the past) like IP was treated better. Or maybe I just didn't care because I was young or because I owned Fast Hack'em and had a few friends with C64s too.
I love the gemini models and think Google has done a great job on them, but no model series I use seems to get context rot more in long conversations. Which seems strange given the longer context.
I absolutely adored these books as a kid! Spend every dime of bookfair money on them every year and used to beg my parents to take me to the library to check out others.
I love the framing of them in this article as the gateway drug to interactive entertainment.
In addition to Korea being one of our most important military allies in the world, you need batteries for military drones, and the US is way behind in the development of a domestic manufacturing supply chain for next gen batteries.
So now we know clearly that nationalist xenophobia the true most important priority for this administration. Or at least, more important than either the domestic economic interests of their own base or strategic national security interests.
> So now we know clearly that nationalist xenophobia the true most important priority for this administration
Just now? The man entered politics and the first thing he said was how he was going to build a wall to keep the "criminal, diseased, rapist" Mexicans out. Yeah, of course this administration is preoccupied with nationalist xenophobia.
> nationalist xenophobia the true most important priority for this administration
It is a little more complicated than that. It is what around 40% of American population want. (Then another 9.5% or so voted for Trump based on the price of eggs, the fact that the other candidate was a woman, and so on).
That's worse, though. It implies that 40% of the "electorate" was (in some combination) either prevented from voting or not bothered enough by the difference between plausible electoral outcomes to actually try to influence said outcome.
Just because the people voted for Trump does not mean that the people don't want visa laws followed. Also, voting for someone does not give that someone carte blanche to do whatever the hell they please.
Can you point me to any resources on DSPy that don't make it look like magic though? It used to be all the hype for a while and then everyone moved on from it.
reply