Observational Memory (OM) is a two-stage compression system that gives agents unlimited conversation length. As messages accumulate beyond a token threshold, an Observer agent extracts structured observations. When observations themselves grow too large, a Reflector agent compresses them further. The main agent (the "Actor") only ever sees: compressed observations + recent unobserved messages.
Three agents are involved:
- Actor — the main agent the user talks to
- Observer — extracts observations from raw messages (default: gemini-2.5-flash, temp 0.3)
- Reflector — compresses observations when they get too large (default: gemini-2.5-flash, temp 0)