I've definitely noticed this anecdotally. Especially with Gemini Pro when provid...

zwaps · 2025-07-14T22:38:01 1752532681

Gemini loses coherence and reasoning ability well before the chat hits the context limitations, and according to this report, it is the best model on several dimensions.

Long story short: Context engineering is still king, RAG is not dead

deadbabe · 2025-07-15T00:19:43 1752538783

RAG was never going away, the people who say that are the same types who say software engineers will be totally replaced with AI.

LLMs will need RAG one way or another, you can hide it from the user, but it still must be there.

tvshtr · 2025-07-14T23:51:13 1752537073

Yep, it can decohere really badly with bigger context. It's not only context related though. Sometimes it can lose focus early on in a way that is impossible to get it back on track.

risyachka · 2025-07-14T23:05:05 1752534305

Yep. The easiest way to tell someone has no experience with LLMs is if they say “RAG is dead”

apwell23 · 2025-07-15T00:19:17 1752538757

> someone has no experience with LLMs

Thats 99% of coders. No need to gatekeep.

Xmd5a · 2025-07-15T09:26:08 1752571568

Gemini loses the notion of context the longer its context is: I often ask it to provide a summary of our discussion for the outside world and it will reference ideas or documents without introducing them, via anaphore, as if the outside world had knowledge of the context.

Inviz · 2025-07-15T01:50:35 1752544235

Cursor lifted "Start a new chat" limitation on gemini and i'm actually now enjoying keeping longer sessions within one window, becuase it's still very reasonable at recall, but doesnt need to restate everything each time

darepublic · 2025-07-15T17:25:46 1752600346

Can you elaborate on how prompts enhanced with rag avoid this context pollution? I don't understand why that would be

irskep · 2025-07-15T04:20:16 1752553216

"Compactions" are just reducing the transcript to a summary of the transcript, right? So it makes sense that it would get worse because the agent is literally losing information, but it wouldn't be due to context rot.

The thing that would signal context rot is when you approach the auto-compact threshold. Am I thinking about this right?

0x457 · 2025-07-15T17:03:21 1752599001

Yes, but on agentic workflows it's possible to do more intelligent compaction.

bayesianbot · 2025-07-15T00:51:48 1752540708

I feel like the optimal coding agent would do this automatically - collect and (sometimes) summarize the required parts of code, MCP responses, repo maps etc., then combine the results into a new message in a new 'chat' that would contain all the required parts and nothing else. It's basically what I already do with aider, and I feel the performance (in situations with a lot of context) is way better than any agentic / more automated workflow I've tried so far, but it is a lot of work.

OccamsMirror · 2025-07-15T06:28:23 1752560903

Claude Code tries, and it seems to be OK at it. It's hard to tell though and it definitely feels like sometimes you absolutely have to quit out and start again.

doctorhandshake · 2025-07-15T12:30:57 1752582657

Try using /clear instead of quitting. Doesn’t clear scrollback buffer but does clear context

gonzric1 · 2025-07-15T06:46:33 1752561993

Appmap's ai agent does this very well.

tough · 2025-07-14T22:36:51 1752532611

Have you tried NotebookLM which basically does this as an app on the bg (chunking and summarising many docs) and you can -chat- with the full corpus using RAG