Fast, scalable memory for modern AI.
A blazingly fast and Rust-native compiled memory architecture based on heuristic and algorithmic methods, with no LLM API calls or costs.
Benchmark results.
LongMemEval · LOCOMO · higher is better
Scores are self-published by each provider and have been reproduced internally.
Blazing Fast.
Fully Observable.
Vector math, statistical inference, and power-law decay. Not LLM calls. Cosine similarity drives novelty detection and semantic overlap. Beta posteriors gate procedural learning. Utility weights are refined by gradient descent after every interaction. Every step is logged and reconstructible.
Salience Filter
Every incoming event is scored for signal value before anything else. Low-salience events are dropped here and never reach storage.
Event Store
High-salience events are written as immutable records to the append-only event log. The full interaction history is always reconstructible from this store.
Memory Compiler
Five sequential passes transform raw events into structured memories: episode detection, semantic extraction, procedural recognition, identity scoring, entity consolidation. No LLM calls.
Compiled Stores
Compiled memories are written to four typed stores, each with distinct decay behavior. Episodic by causal chain. Semantic with confidence and conflict tracking. Procedural with Bayesian posteriors. Identity with drift modeling.
Retrieval Policy Engine
At recall time, the policy engine selects candidates by causal relevance to the current goal, not text similarity. Each memory type is queried differently.
Working Memory Controller
Retrieved candidates are ranked, deduplicated, and packed into a token budget. Contradictions are surfaced as explicit signals. The context window is assembled here.
Memory Utility Function
After each interaction, outcome signals update five learned utility weights. The system adjusts its own salience threshold and retrieval priorities based on what correlated with success.
How it compares.
Memory that is
goal oriented.
Enagram is not semantic search or vector similarity matching. It is designed to respond to the agent's goals and surface relevant facts, procedures, and identities from its memory compilation step.
Labs & Research.
Enagram is developed on top of research into cognitive and continuum memory architectures for long-horizon LLM agents. Every mechanism in the pipeline has a grounding in published research.
arXiv:2601.09913Continuum Memory Architectures for Long-Horizon LLM AgentsEVENT SEGMENTATION THEORY
Zacks et al. 2007
Episode boundaries detected at prediction error spikes, not arbitrary token counts.
ACT-R BASE-LEVEL LEARNING
Anderson 1983, 2004
Error and reward signals strengthen memory encoding at the moment of outcome.
PREDICTIVE PROCESSING
Friston 2010
Salience proportional to prediction error — novel inputs are weighted over repetition.
GLOBAL WORKSPACE THEORY
Dehaene et al. 2003
High-salience events bypass scoring and commit unconditionally to the event store.
PRICING
Simple, transparent pricing.
FREE
- —1 application
- —Shared compute — 250m CPU · 256Mi memory
- —1Gi storage
- —Data & analytics
- —Memory configuration
- —Community support
PRO
- —Unlimited applications
- —Dedicated compute — 1000m CPU · 1Gi memory
- —10Gi storage
- —Data & analytics
- —Memory configuration
- —Email support
ENTERPRISE
- —Custom resource allocation
- —On-premise deployment
- —Data & analytics
- —Memory configuration
- —SLA guarantees
- —Dedicated support
Start building
today.
Get started for free. Upgrade when you need more.