ENGRAM â€” The Agent Memory Engine

architecture

Six layers of structured memory

Each layer serves a distinct cognitive role. Memories flow through layers as they age, compress, and consolidate.

resolution priority

CORE › RESIDUE › SKILL › GRAPH › EPISODE › BUFFER

01 BUFFER

Working Memory & Active Context

What the agent is thinking about right now. Short-lived, high-throughput context that drives the current inference window. Expires in seconds to minutes and evicts old entries via LRU when capacity is reached.

[memory.buffer]
ttl      = "5m"
capacity = 1000
strategy = "lru"
priority = 6

02 EPISODE

Event Logs with Timestamps

A chronological record of what happened and when. Every entry carries a timestamp and decays according to a configurable half-life â€” older events grow dimmer without being erased outright.

[memory.episode]
half_life   = "2h"
decay       = "exponential"
max_entries = 500
priority    = 5

03 GRAPH

Knowledge Graph / Semantic Facts

Structured facts and relational knowledge stored as a persistent, updatable graph. What the agent knows about the world â€” independent of when it learned it. Nodes merge and evolve over time.

[memory.graph]
persistent = true
storage    = "json"
strategy   = "merge"
priority   = 4

04 SKILL

Learned Procedures & Behaviors

Encoded workflows, task patterns, and learned behaviors. The agent's procedural memory â€” how to accomplish goals, not just what goals exist. Reinforced on success, versioned across updates.

[memory.skill]
persistent    = true
versioned     = true
reinforce_on  = "success"
priority      = 3

05 RESIDUE

Compressed Traces of Decayed Memories

Lossy but searchable distillations of memories that have aged out of EPISODE. Not deleted â€” compressed. The residue of experience that shapes behavior without consuming context budget.

[memory.residue]
compression   = "semantic"
ratio         = 0.05
source_layers = ["episode"]
priority      = 2

06 CORE

Immutable Foundational Memories

Write-once memories that define the agent's fundamental identity, values, and constraints. CORE never decays, never overwrites, and always resolves first. The bedrock that holds everything else steady.

[memory.core]
immutable = true
ttl       = "forever"
priority  = 1

comparison

RAG vs ENGRAM

Retrieval is not memory. ENGRAM gives agents structured, layered, decaying memory â€” not just a vector dump.

RAG / Vector Store

✗
Flat embedding dump
All memories treated as equal vectors â€” no hierarchy, no priority, no structure
✗
No time awareness
Embeddings carry no intrinsic timestamp â€” recency is bolted on as metadata at best
✗
No decay
Old context survives indefinitely or vanishes entirely â€” no graceful aging
✗
Retrieved by similarity only
Locked to cosine distance â€” semantically distant but critically important memories get lost
✗
Stateless between calls
Every inference starts fresh â€” no continuity, no accumulated understanding
✗
No priority resolution
No way to declare that some memories must always win conflicts over others
✗
Loses context as docs grow
Retrieval quality degrades as the vector store grows â€” signal drowns in noise
✗
No structure
Opaque internals â€” memory state is unreadable without embedding model access

ENGRAM

✓
6 structured layers
BUFFER, EPISODE, GRAPH, SKILL, RESIDUE, CORE â€” each with a defined cognitive role
✓
Timestamps on every memory
EPISODE layer carries full timestamps and half-life decay from the moment of storage
✓
Configurable decay per layer
TTL, half-life, and decay strategy are declared per layer in plain TOML
✓
Priority resolution: CORE→BUFFER
Explicit conflict resolution â€” CORE always wins, BUFFER is lowest priority
✓
State persists and evolves
Memory accumulates across sessions â€” agents build genuine continuity over time
✓
Consolidation pipeline
BUFFER → EPISODE → GRAPH consolidation moves memories up the stack automatically
✓
Residue preserves compressed traces
Decayed EPISODE entries compress into RESIDUE â€” searchable, lossy, never truly lost
✓
TOML-declared schema
Full memory architecture is a human-readable file you can read, commit, and review in a PR

design principles

Built on four principles

ENGRAM is opinionated by design. These four principles drive every decision in the library and the format.

structure > embeddings

A slot-based memory architecture beats a bag of vectors. Know what each memory means before you store it. ENGRAM gives every memory a declared type, layer, and role â€” nothing floats anonymously in latent space.

decay > deletion

Memories don't vanish â€” they compress into RESIDUE and consolidate into GRAPH. Nothing is truly lost, just transformed. Forgetting should be intentional, gradual, and reversibly auditable.

layers > dumps

Not all memories are equal. CORE never changes. BUFFER expires in minutes. The architecture reflects cognitive reality â€” cramming everything into one context window is not a memory strategy.

toml > vibes

Your agent's memory should be readable by humans. Declare it in a file, commit it to git, review it in a PR. If you can't explain your memory architecture in a text editor, you don't control it.

usage

Simple API, powerful memory

The ENGRAM library gives you a clean TypeScript/JavaScript API over the full layer stack. One config file. Six layers. Zero boilerplate.

engram.toml

# engram.toml

[agent]
id      = "research-assistant"
version = "0.1.0"

[memory.buffer]
ttl      = "5m"
capacity = 1000
strategy = "lru"

[memory.episode]
half_life   = "2h"
max_entries = 500

[memory.graph]
persistent = true
storage    = "json"

[memory.core]
immutable = true

              
            
agent.ts

            // npm install @MateoKnox/engram
import { EngramEngine } from '@MateoKnox/engram';

const engine = new EngramEngine('./engram.toml');
await engine.init();

// Store a memory in the episode layer
await engine.store('episode', 'User asked about photosynthesis', {
  tags: ['biology', 'user-query'],
  importance: 0.8
});

// Recall across all layers
const memories = await engine.recall('photosynthesis', {
  layers: ['graph', 'episode', 'core'],
  limit: 5
});

// Run decay pass
await engine.decay();

// Consolidate buffer → episode → graph
await engine.consolidate();
          

ENGRAM
The Agent Memory Engine

Works with any model

Six layers of structured memory

Working Memory & Active Context

Event Logs with Timestamps

Knowledge Graph / Semantic Facts

Learned Procedures & Behaviors

Compressed Traces of Decayed Memories

Immutable Foundational Memories

RAG vs ENGRAM

Built on four principles

Simple API, powerful memory

Get started with ENGRAM

ENGRAM The Agent Memory Engine

Works with any model

Six layers of structured memory

Working Memory & Active Context

Event Logs with Timestamps

Knowledge Graph / Semantic Facts

Learned Procedures & Behaviors

Compressed Traces of Decayed Memories

Immutable Foundational Memories

RAG vs ENGRAM

Built on four principles

Simple API, powerful memory

Get started with ENGRAM

ENGRAM
The Agent Memory Engine