I built a few multi agent systems and went down a rabbit hole where I reached an...

WiSaGaN · on Dec 18, 2023

The best analogy I can think of is that if you want your agent to accomplish something without persisting a long chat history, but instead use the agent to reorganize and change the prompt, you would choose to use the method Leonard uses in the film "Memento". Due to his condition, Leonard cannot form new memories and struggles to recall events that occur after his injury. Leonard knows his condition and uses tattoos to record facts "between sessions". Each chat completion of the LLM is similar to "each session" Leonard experiences. The prompt, with the help of the agent, can persist across LLM chat completions, which is similar to the tattoos, and various notes and photos he has.

nerdponx · on Dec 18, 2023

Yeah but there's a plot twist wherein it's revealed that the method has some severe flaws, and it's not as effective as Leonard thinks it is.

Widdershin · on Dec 18, 2023

Never considered that Memento was a cinematic adaption of Reflections on Trusting Trust.

tbrockman · on Dec 19, 2023

Which so satisfyingly parallels prompt injection attacks.

ianand · on Dec 18, 2023

Andrew Mayne (ex-OpenAI prompt engineer) actually uses Memento as a great metaphor for describing LLMs https://x.com/ianand/status/1723976526436417546?s=46&t=x6aIM...

abecedarius · on Dec 18, 2023

See also https://en.wikipedia.org/wiki/Soldier_of_the_Mist

ilteris · on Dec 18, 2023

context size is still going to be an issue when history gets large, correct?

Hugsun · on Dec 18, 2023

Excellent analogy

regularfry · on Dec 18, 2023

That's true if you're passing messages between identical models. There's a question to ask as to whether different models trained for different tasks would be better than single, multipurpose models though. My gut feel is that eventually multipurpose models will win because you don't have the embedded cost of relearning what syntactic structure is, but for a given training time and number of weights it's not clear whether that's true today.

jamilton · on Dec 18, 2023

Could be the same base multipurpose models, but finetuned to be somewhat specialized.

regularfry · on Dec 19, 2023

Yeah, same principle. If you're passing messages between things that will react exactly the same to the same prompt, there's not a lot of point (unless the parallelism is important). If you've got fine-tunes, the whole point is that they will be better at some questions than the baseline.

Mind you, there's another idea there about mixture of experts as implemented by deciding which fine-tune to load, depending on the prompt itself... I'm sure that's been looked at.

joshcho · on Dec 18, 2023

Interested in connecting. I have been creating a DSL for LLMs/agents inspired by Forth/Prolog.

Feel free to e-mail me at [email protected]

pyinstallwoes · on Dec 19, 2023

So managing state in one big message window with templates for it?

```message

{{for agent}} {{{agent/message}}}

etc? ```

sparrowInHand · on Dec 18, 2023

Maybe the LLM needs something like dantes Divina Commedia as previous instances describe in condensed forms why a previous prompts conclussions failed, to succesfully navigate around the trained in local minimas. I diary of failure to keep track on how to reach a success.

visarga · on Dec 18, 2023

yes, but multiprompts are fragile while dividing LLM work over multiple calls more stable

if you can get away with using GPT-3.5 you got speed and decent pricing