· Hermes Academy

The most important correction

If you say “Hermes memory” as if it names a single subsystem, you are already losing precision.

Hermes splits memory-like behavior into at least these layers:

hermes_state.py stores sessions and messages in SQLite with WAL and FTS5.

That layer answers:

It is better thought of as durable conversation accounting than semantic memory.

Hermes also has small, explicit files:

These are not giant stores. They are bounded, curated facts intended to stay close to the prompt.

The clever part is the frozen snapshot pattern:

That preserves stable prompt behavior and good cache economics.

The provider layer adds optional long-term recall systems like Honcho or Supermemory.

MemoryManager deliberately keeps:

That one-external-provider rule prevents a lot of mess:

When the agent wants old context, it does not dump entire transcripts into the prompt.

Instead Hermes:

That gives the main model a focused recap rather than a transcript flood.

Compression is memory-adjacent but conceptually separate. It deals with current working context, not just long-term recall.

Hermes can:

The summary is explicitly framed as reference, not as fresh instructions. Provenance matters.

Hermes is strong here because it refuses to pretend that:

are all the same thing.

That layered view is one of the clearest signs that Hermes was built by people who have already felt the pain of overloading the word “memory.”