McpEngramMemory.Core 0.4.1

There is a newer version of this package available.
See the version list below for details.

dotnet add package McpEngramMemory.Core --version 0.4.1

NuGet\Install-Package McpEngramMemory.Core -Version 0.4.1

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="McpEngramMemory.Core" Version="0.4.1" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

<PackageVersion Include="McpEngramMemory.Core" Version="0.4.1" />
                    

                            Directory.Packages.props

<PackageReference Include="McpEngramMemory.Core" />
                    

                            Project file

For projects that support Central Package Management (CPM), copy this XML node into the solution Directory.Packages.props file to version the package.

paket add McpEngramMemory.Core --version 0.4.1

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: McpEngramMemory.Core, 0.4.1"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

#:package McpEngramMemory.Core@0.4.1

#:package directive can be used in C# file-based apps starting in .NET 10 preview 4. Copy this into a .cs file before any lines of code to reference the package.

#addin nuget:?package=McpEngramMemory.Core&version=0.4.1
                    

                            Install as a Cake Addin

#tool nuget:?package=McpEngramMemory.Core&version=0.4.1
                    

                            Install as a Cake Tool

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

MCP Engram Memory

A cognitive memory MCP server that provides an LLM with namespace-isolated vector storage, k-nearest-neighbor search (cosine similarity), a knowledge graph, semantic clustering, lifecycle management with activation energy decay, and physics-based re-ranking. Data is persisted to disk as JSON with debounced writes.

Quickstart

Option 1 — dotnet (recommended)

git clone https://github.com/wyckit/mcp-engram-memory.git
cd mcp-engram-memory
dotnet build

Add to your MCP client config (Claude Code, Copilot, etc.):

{
  "mcpServers": {
    "engram-memory": {
      "command": "dotnet",
      "args": ["run", "--project", "/path/to/mcp-engram-memory/src/McpEngramMemory"]
    }
  }
}

Option 2 — Docker

docker build -t mcp-engram-memory .
docker run -i -v memory-data:/app/data mcp-engram-memory

Option 3 — NuGet (embed in your app)

dotnet add package McpEngramMemory.Core --version 0.3.0

That's it. The server exposes 38 MCP tools. To reduce tool count, set MEMORY_TOOL_PROFILE:

Profile	Tools	Use case
`minimal`	5	Simple store/search — drop-in memory for any agent
`standard`	18	Adds graph, lifecycle, clustering, intelligence
`full`	38	Everything including expert routing, debate, benchmarks (default)

{
  "env": { "MEMORY_TOOL_PROFILE": "minimal" }
}

See examples/ for ready-to-use config files.

Architecture

graph TD
    subgraph MCP["MCP Server (stdio)"]
        Tools["11 Tool Classes<br/>38 MCP Tools"]
    end

    Tools --> CI["CognitiveIndex<br/><i>Thin facade: CRUD, locking, limits</i>"]

    CI --> RE["Retrieval"]
    CI --> GR["Graph"]
    CI --> LC["Lifecycle"]
    CI --> IN["Intelligence"]
    CI --> EX["Experts"]
    CI --> EV["Evaluation"]

    subgraph RE["Retrieval Engine"]
        VS["VectorSearchEngine<br/><i>Two-stage Int8→FP32</i>"]
        HS["HybridSearchEngine<br/><i>BM25 + Vector RRF</i>"]
        HW["HnswIndex<br/><i>O(log N) ANN</i>"]
        QE["QueryExpander"]
        TR["TokenReranker"]
        VQ["VectorQuantizer<br/><i>SIMD Int8</i>"]
    end

    subgraph GR["Knowledge Graph"]
        KG["KnowledgeGraph<br/><i>Directed edges, BFS</i>"]
    end

    subgraph LC["Lifecycle Engine"]
        LE["LifecycleEngine<br/><i>Decay, state transitions</i>"]
        PE["PhysicsEngine<br/><i>Gravitational re-ranking</i>"]
    end

    subgraph IN["Intelligence"]
        CM["ClusterManager"]
        AS["AccretionScanner<br/><i>DBSCAN</i>"]
        DD["DuplicateDetector"]
    end

    subgraph EX["Expert Routing"]
        ED["ExpertDispatcher"]
        DM["DebateSessionManager"]
    end

    subgraph EV["Evaluation"]
        BR["BenchmarkRunner<br/><i>MRR, nDCG, Recall@K</i>"]
        MC["MetricsCollector<br/><i>P50/P95/P99</i>"]
    end

    CI --> NS["NamespaceStore<br/><i>Lazy loading, partitioned</i>"]
    NS --> SP["Storage Provider"]

    subgraph SP["Storage"]
        PM["PersistenceManager<br/><i>JSON + SHA-256 checksums</i>"]
        SQ["SqliteStorageProvider<br/><i>WAL mode</i>"]
    end

    NS --> EMB["OnnxEmbeddingService<br/><i>bge-micro-v2, 384-dim</i>"]

    subgraph BG["Background Services"]
        BG1["EmbeddingWarmup<br/><i>startup</i>"]
        BG2["DecayService<br/><i>every 15 min</i>"]
        BG3["AccretionService<br/><i>every 30 min</i>"]
    end

The Core Memory Loop

INGEST → INDEX → RETRIEVE → REINFORCE → DECAY → SUMMARIZE/COLLAPSE
   │                  │          │          │              │
   └── store_memory   │   memory_feedback  │    collapse_cluster
       (embed+upsert) │   (agent feedback) │    (DBSCAN → summary)
                      └── search_memory    └── decay_cycle
                          (k-NN/hybrid)    (activation energy)

Memories move through lifecycle states based on usage:

STM (short-term) ──promote──→ LTM (long-term) ──decay──→ Archived
                   ←─────────────────────────────────── deep_recall
                              (auto-resurrect if score ≥ 0.7)

Project Structure

The solution is split into two projects:

Project	Type	Description
`McpEngramMemory`	Executable	MCP server with stdio transport — register this in your MCP client
`McpEngramMemory.Core`	NuGet Library	Core engine (vector index, graph, clustering, lifecycle, persistence) — use this to embed the memory engine in your own application

src/
  McpEngramMemory/              # MCP server (Program.cs + Tool classes)
  McpEngramMemory.Core/         # Core library
    Models/                     # CognitiveEntry, SearchResults, MemoryLimitsConfig, etc.
    Services/
      CognitiveIndex.cs         # Thin facade: CRUD, locking, delegates to engines below
      NamespaceStore.cs         # Namespace-partitioned storage with lazy loading
      PhysicsEngine.cs          # Gravitational force re-ranking
      Retrieval/                # Search pipeline
        VectorMath.cs           #   SIMD-accelerated dot product & norm
        VectorSearchEngine.cs   #   Two-stage Int8 screening + FP32 reranking
        HnswIndex.cs            #   HNSW approximate nearest neighbor index
        HybridSearchEngine.cs   #   BM25 + vector RRF fusion
        BM25Index.cs            #   Keyword search index
        QueryExpander.cs        #   IDF-based query term expansion
        TokenReranker.cs        #   Token-overlap reranker (implements IReranker)
        VectorQuantizer.cs      #   Int8 scalar quantization
        IReranker.cs            #   Pluggable reranker interface
      Graph/
        KnowledgeGraph.cs       #   Directed graph with adjacency lists
      Intelligence/
        ClusterManager.cs       #   Semantic cluster CRUD + centroid computation
        AccretionScanner.cs     #   DBSCAN density scanning + reversible collapse
        DuplicateDetector.cs    #   Pairwise cosine similarity duplicate detection
        AccretionBackgroundService.cs
      Lifecycle/
        LifecycleEngine.cs      #   Decay, state transitions, deep recall
        DecayBackgroundService.cs
      Experts/
        ExpertDispatcher.cs     #   Semantic routing to expert namespaces
        DebateSessionManager.cs #   Debate session state + alias mapping
      Evaluation/
        BenchmarkRunner.cs      #   IR quality benchmarks
        MetricsCollector.cs     #   Operational metrics + percentiles
      Storage/
        IStorageProvider.cs     #   Storage abstraction interface
        PersistenceManager.cs   #   JSON file backend with debounced writes
        SqliteStorageProvider.cs #   SQLite backend with WAL mode
tests/
  McpEngramMemory.Tests/        # xUnit tests (458 tests)
benchmarks/
  baseline-v1.json              # IR quality baseline (MRR 1.0, nDCG@5 0.938, Recall@5 0.867)
  baseline-paraphrase-v1.json
  baseline-multihop-v1.json
  baseline-scale-v1.json
  ideas/                        # Benchmark proposals and analysis

NuGet Package

The core engine is available as a NuGet package for use in your own .NET applications.

dotnet add package McpEngramMemory.Core --version 0.3.0

Library Usage

using McpEngramMemory.Core.Models;
using McpEngramMemory.Core.Services;
using McpEngramMemory.Core.Services.Graph;
using McpEngramMemory.Core.Services.Intelligence;
using McpEngramMemory.Core.Services.Lifecycle;
using McpEngramMemory.Core.Services.Storage;

// Create services
var persistence = new PersistenceManager();
var embedding = new OnnxEmbeddingService();
var index = new CognitiveIndex(persistence);
var graph = new KnowledgeGraph(persistence, index);
var clusters = new ClusterManager(index, persistence);
var lifecycle = new LifecycleEngine(index, persistence);

// Store a memory
var vector = embedding.Embed("The capital of France is Paris");
var entry = new CognitiveEntry("fact-1", vector, "default", "The capital of France is Paris", "facts");
index.Upsert(entry);

// Search by text
var queryVector = embedding.Embed("French capital");
var results = index.Search(queryVector, "default", k: 5);

Tech Stack

.NET 8, C#
ModelContextProtocol 1.0.0
FastBertTokenizer 0.4.67 (WordPiece tokenization)
Microsoft.ML.OnnxRuntime 1.17.0 (ONNX model inference)
Microsoft.Data.Sqlite 8.0.11 (SQLite storage backend)
bge-micro-v2 ONNX model (384-dimensional vectors, MIT license, downloaded at build time)
Microsoft.Extensions.Hosting 8.0.1
xUnit (tests)

MCP Tools (38 total)

Core Memory (3 tools)

Tool	Description
`store_memory`	Store a vector embedding with text, category, and optional metadata. Defaults to STM lifecycle state. Warns if near-duplicates are detected.
`search_memory`	k-NN search within a namespace with optional lifecycle/category filtering, summary-first mode, physics-based re-ranking, and `explain` mode for full retrieval diagnostics.
`delete_memory`	Remove a memory entry by ID. Cascades to remove associated graph edges and cluster memberships.

Knowledge Graph (4 tools)

Tool	Description
`link_memories`	Create a directed edge between two entries with a relation type and weight. `cross_reference` auto-creates bidirectional edges.
`unlink_memories`	Remove edges between entries, optionally filtered by relation type.
`get_neighbors`	Get directly connected entries with edges. Supports direction filtering (outgoing/incoming/both).
`traverse_graph`	Multi-hop BFS traversal with configurable depth (max 5), relation filter, minimum weight, and max results.

Supported relation types: parent_child, cross_reference, similar_to, contradicts, elaborates, depends_on, custom.

Semantic Clustering (5 tools)

Tool	Description
`create_cluster`	Create a named cluster from member entry IDs. Centroid is computed automatically.
`update_cluster`	Add/remove members and update the label. Centroid is recomputed.
`store_cluster_summary`	Store an LLM-generated summary as a searchable entry linked to the cluster.
`get_cluster`	Retrieve full cluster details including members and summary info.
`list_clusters`	List all clusters in a namespace with summary status.

Lifecycle Management (5 tools)

Tool	Description
`promote_memory`	Manually transition a memory between lifecycle states (`stm`, `ltm`, `archived`).
`memory_feedback`	Provide agent feedback on a memory's usefulness. Positive feedback boosts activation energy and records an access; negative feedback suppresses it. Triggers state transitions when thresholds are crossed. Closes the agent reinforcement loop.
`deep_recall`	Search across ALL lifecycle states. Auto-resurrects high-scoring archived entries above the resurrection threshold.
`decay_cycle`	Trigger activation energy recomputation and state transitions for a namespace.
`configure_decay`	Set per-namespace decay parameters (decayRate, reinforcementWeight, stmThreshold, archiveThreshold). Used by background service and `decay_cycle` with `useStoredConfig=true`.

Activation energy formula: (accessCount x reinforcementWeight) - (hoursSinceLastAccess x decayRate)

Admin (2 tools)

Tool	Description
`get_memory`	Retrieve full cognitive context for an entry (lifecycle, edges, clusters). Does not count as an access.
`cognitive_stats`	System overview: entry counts by state, cluster count, edge count, and namespace list.

Accretion (4 tools)

Tool	Description
`get_pending_collapses`	List dense LTM clusters detected by the background scanner that are awaiting LLM summarization.
`collapse_cluster`	Execute a pending collapse: store a summary entry, archive the source members, and create a cluster.
`dismiss_collapse`	Dismiss a detected collapse and exclude its members from future scans.
`trigger_accretion_scan`	Manually run a DBSCAN density scan on LTM entries in a namespace.

collapse_cluster reliability behavior:

If collapse steps complete successfully, the pending collapse is removed and a reversal record is persisted to disk.
If summary storage or any member archival step fails, the tool returns an error and preserves the pending collapse so the same collapseId can be retried.
Collapse records survive server restarts and can be reversed with uncollapse_cluster.

Intelligence & Safety (5 tools)

Tool	Description
`detect_duplicates`	Find near-duplicate entries in a namespace by pairwise cosine similarity above a configurable threshold.
`find_contradictions`	Surface contradictions: entries linked with `contradicts` graph edges, plus high-similarity topic-relevant pairs for review.
`merge_memories`	Merge two duplicate entries: keeps the first entry's vector, combines metadata and access counts, transfers graph edges and cluster memberships, and archives the second entry.
`uncollapse_cluster`	Reverse a previously executed accretion collapse: restore archived members to pre-collapse state, delete summary, clean up cluster.
`list_collapse_history`	List all reversible collapse records for a namespace.

Panel of Experts / Debate (3 tools)

Tool	Description
`consult_expert_panel`	Consult a panel of experts by running parallel searches across multiple expert namespaces. Stores each perspective in an active-debate namespace and returns integer-aliased results so the LLM can reference nodes without managing UUIDs. Replaces multiple `search_memory` + `store_memory` calls with a single macro-command.
`map_debate_graph`	Map logical relationships between debate nodes using integer aliases from `consult_expert_panel`. Translates aliases to UUIDs internally and batch-creates knowledge graph edges. Replaces multiple `link_memories` calls with a single macro-command.
`resolve_debate`	Resolve a debate by storing a consensus summary as LTM, linking it to the winning perspective, and batch-archiving all raw debate nodes. Cleans up session state. Replaces manual `store_memory` + `link_memories` + `promote_memory` calls with a single macro-command.

Debate workflow: consult_expert_panel (gather perspectives) → map_debate_graph (define relationships) → resolve_debate (store consensus). Sessions use integer aliases (1, 2, 3...) so the LLM never handles UUIDs. Sessions auto-expire after 1 hour.

Benchmarking & Observability (3 tools)

Tool	Description
`run_benchmark`	Run an IR quality benchmark. Datasets: `default-v1` (25 seeds, 20 queries), `paraphrase-v1` (25 seeds, 15 queries), `multihop-v1` (25 seeds, 15 queries), `scale-v1` (80 seeds, 30 queries), `realworld-v1` (30 seeds, 20 queries — cognitive memory patterns). Computes Recall@K, Precision@K, MRR, nDCG@K, and latency percentiles.
`get_metrics`	Get operational metrics: latency percentiles (P50/P95/P99), throughput, and counts for search, store, and other operations.
`reset_metrics`	Reset collected operational metrics. Optionally filter by operation type.

Five benchmark datasets: four covering generic CS topics (programming languages, data structures, ML, databases, networking, systems, security, DevOps) and one real-world dataset modeled after actual cognitive memory entries (architecture decisions, bug fixes, code patterns, user preferences, lessons learned). Relevance grades use a 0–3 scale (3 = highly relevant).

Maintenance (2 tools)

Tool	Description
`rebuild_embeddings`	Re-embed all entries in one or all namespaces using the current embedding model. Use after upgrading the embedding model to regenerate vectors from stored text. Entries without text are skipped. Preserves all metadata, lifecycle state, and timestamps.
`compression_stats`	Show vector compression statistics for a namespace or all namespaces. Reports FP32 vs Int8 disk savings, quantization coverage, and memory footprint estimates.

Expert Routing (2 tools)

Tool	Description
`dispatch_task`	Route a query to the most relevant expert namespace via semantic similarity against the meta-index. Returns the expert profile and top memories from that namespace as context, or `needs_expert` status if no qualified expert is found.
`create_expert`	Instantiate a new expert namespace and register it in the semantic routing meta-index. The persona description is embedded for future query routing.

Expert routing workflow: dispatch_task (route query) → if miss: create_expert (define specialist) → dispatch_task (retry). The system maintains a hidden _system_experts meta-index that maps queries to specialized namespaces via cosine similarity (default threshold: 0.75). Experts within a 5% score margin of the top match are returned as candidates.

Architecture

Services

CognitiveIndex is a thin facade managing CRUD, locking, and memory limits. Search, hybrid search, and duplicate detection are delegated to stateless engines that operate on data snapshots.

Service	Namespace	Description
`CognitiveIndex`	`Services`	Thread-safe facade: CRUD, lifecycle state, access tracking, memory limits enforcement. Delegates search to engines below
`NamespaceStore`	`Services`	Namespace-partitioned storage with lazy loading from disk and BM25 indexing
`VectorSearchEngine`	`Retrieval`	Stateless k-NN search with HNSW ANN candidate generation (≥200 entries) or two-stage Int8 screening (≥30 entries) → FP32 exact reranking
`HnswIndex`	`Retrieval`	Hierarchical Navigable Small World graph for O(log N) approximate nearest neighbor search with soft deletion and compacting rebuild
`HybridSearchEngine`	`Retrieval`	Stateless BM25 + vector fusion via Reciprocal Rank Fusion (RRF)
`BM25Index`	`Retrieval`	In-memory keyword search index with TF-IDF scoring
`QueryExpander`	`Retrieval`	IDF-based query term expansion for improved recall
`TokenReranker`	`Retrieval`	Token-overlap reranker implementing `IReranker`
`VectorMath`	`Retrieval`	SIMD-accelerated dot product and norm (static utility)
`VectorQuantizer`	`Retrieval`	Int8 scalar quantization: `Quantize`, `Dequantize`, SIMD `Int8DotProduct`, `ApproximateCosine`
`DuplicateDetector`	`Intelligence`	Stateless pairwise cosine similarity duplicate detection (O(N) single-entry, O(N²) namespace-wide)
`KnowledgeGraph`	`Graph`	In-memory directed graph with adjacency lists, bidirectional edge support, edge transfer, and contradiction surfacing
`ClusterManager`	`Intelligence`	Semantic cluster CRUD with automatic centroid computation and membership transfer
`AccretionScanner`	`Intelligence`	DBSCAN-based density scanning with reversible collapse history (persisted to disk)
`LifecycleEngine`	`Lifecycle`	Activation energy computation, agent feedback reinforcement, per-namespace decay configs, decay cycles, and state transitions (STM/LTM/archived)
`PhysicsEngine`	`Services`	Gravitational force re-ranking with "Asteroid" (semantic) + "Sun" (importance) output
`BenchmarkRunner`	`Evaluation`	IR quality benchmark execution with Recall@K, Precision@K, MRR, nDCG@K scoring
`MetricsCollector`	`Evaluation`	Thread-safe operational metrics with P50/P95/P99 latency percentiles
`DebateSessionManager`	`Experts`	Volatile in-memory session state for debate workflows with integer alias mapping and 1-hour TTL auto-purge
`ExpertDispatcher`	`Experts`	Semantic routing engine that maps queries to specialized expert namespaces via a hidden meta-index
`PersistenceManager`	`Storage`	JSON file-based `IStorageProvider` with debounced async writes, SHA-256 checksums, and crash recovery
`SqliteStorageProvider`	`Storage`	SQLite-based `IStorageProvider` with WAL mode, schema migration framework, and incremental per-entry writes
`OnnxEmbeddingService`	`Services`	384-dimensional vector embeddings via bge-micro-v2 ONNX model with FastBertTokenizer
`HashEmbeddingService`	`Services`	Deterministic hash-based embeddings for testing/CI (no model dependency)

Background Services

Service	Interval	Description
`EmbeddingWarmupService`	Startup	Warms up the embedding model on server start so first queries are fast
`DecayBackgroundService`	15 minutes	Runs activation energy decay on all namespaces using stored per-namespace configs
`AccretionBackgroundService`	30 minutes	Scans all namespaces for dense LTM clusters needing summarization

Models

Model	Description
`CognitiveEntry`	Core memory entry with vector, text, metadata, lifecycle state, and activation energy
`QuantizedVector`	Int8 quantized vector with `sbyte[]` data, min/scale for reconstruction, and precomputed self-dot product
`FloatArrayBase64Converter`	JSON converter for `float[]` — writes Base64 strings, reads both Base64 and legacy JSON arrays for backwards compatibility

Searchable Compression

Vectors use a lifecycle-driven compression pipeline:

STM entries: Full FP32 precision for maximum search accuracy
LTM/archived entries: Auto-quantized to Int8 (asymmetric min/max → [-128, 127]) on state transition
HNSW index: Namespaces with 200+ entries auto-build an HNSW graph for O(log N) approximate nearest neighbor candidate generation
Two-stage search: Namespaces with 30–199 entries use Int8 screening (top k×5 candidates) followed by FP32 exact cosine reranking
SIMD acceleration: Int8DotProduct uses System.Numerics.Vector<T> for portable hardware-accelerated dot products (sbyte→short→int widening pipeline)
Base64 persistence: Vectors are serialized as Base64 strings instead of JSON number arrays, reducing disk usage by ~60%. Legacy JSON arrays are still readable for backwards compatibility.

Persistence

Two storage backends are available, selectable via environment variable:

JSON file backend (default):

Data stored in a data/ directory as JSON files
{namespace}.json — entries with Base64-encoded vectors (per namespace)
_edges.json — graph edges (global)
_clusters.json — semantic clusters (global)
_collapse_history.json — reversible collapse records (global)
_decay_configs.json — per-namespace decay configurations (global)
Writes are debounced (500ms default) with SHA-256 checksums for crash recovery

SQLite backend (MEMORY_STORAGE=sqlite):

Single memory.db file with WAL mode for concurrent read/write
Tables: entries, edges, clusters, collapse_history, decay_configs, schema_version
Automatic schema migrations (v1→v2 adds lifecycle_state column with backfill)
Suitable for higher-throughput or multi-process scenarios

Environment Variables

Variable	Default	Description
`MEMORY_TOOL_PROFILE`	`full`	Tool profile: `minimal` (5 tools), `standard` (18 tools), `full` (37 tools)
`MEMORY_STORAGE`	`json`	Storage backend: `json` or `sqlite`
`MEMORY_SQLITE_PATH`	`data/memory.db`	SQLite database file path (only when `MEMORY_STORAGE=sqlite`)
`MEMORY_MAX_NAMESPACE_SIZE`	unlimited	Maximum entries per namespace
`MEMORY_MAX_TOTAL_COUNT`	unlimited	Maximum total entries across all namespaces

Usage

MCP Server

Configure the MCP server in your client (e.g. Claude Desktop, VS Code):

{
  "mcpServers": {
    "engram-memory": {
      "command": "dotnet",
      "args": ["run", "--project", "/path/to/mcp-engram-memory/src/McpEngramMemory"],
      "env": {
        "MEMORY_STORAGE": "sqlite",
        "MEMORY_MAX_NAMESPACE_SIZE": "10000"
      }
    }
  }
}

The env block is optional. Omit it to use the JSON file backend with no memory limits.

AI Assistant Setup

Each setup below is a single prompt you paste into the respective tool. The AI will read this README, create all config files, and write the custom instructions for you.

Claude Code Setup

Open Claude Code in your project directory and paste:

Set up mcp-engram-memory as my persistent memory system. Do the following:

1. Add the MCP server to my Claude Code config. The server runs via:
   command: dotnet
   args: run --project /path/to/mcp-engram-memory/src/McpEngramMemory

2. Create or update my CLAUDE.md (global at ~/.claude/CLAUDE.md) with these sections:

   ## Recall: Search Before You Work
   - At conversation start, search vector memory using up to 3 parallel agents:
     Agent 1: search_memory in the project namespace for the current task
       (use hybrid: true and expandGraph: true for richer recall)
     Agent 2: search_memory in "work" and "synthesis" namespaces for cross-project patterns
     Agent 3: search_memory with alternative phrasings/keywords in the project namespace
       (use hybrid: true for keyword+vector fusion)
   - For graph-connected knowledge, use expandGraph: true to pull in linked memories
   - Tool selection: search_memory for project context, dispatch_task for cross-domain
     questions, consult_expert_panel for multi-perspective analysis, deep_recall for
     archived knowledge, detect_duplicates and find_contradictions for quality checks

   ## Store: Save What You Learn
   - Store memories after completing tasks, fixing bugs, learning patterns, or receiving
     corrections. Use the project directory name as namespace, kebab-case IDs, include
     domain keywords in text for searchability, and categorize as one of: decision, pattern,
     bug-fix, architecture, preference, lesson, reference
   - Pre-store quality checks: verify text is self-contained, includes domain keywords,
     doesn't duplicate existing memories, and has the correct category
   - When store_memory warns about duplicates: skip if existing is accurate, upsert same
     ID if outdated, or store and link if both are distinct

   ## Expert Routing
   - Use dispatch_task for open-ended questions. If it returns needs_expert,
     call create_expert with a detailed persona, then seed that expert's namespace
   - Lifecycle: promote STM to LTM when recalled 2+ times, documents a stable pattern,
     captures a recurring bug fix, or records a user correction
   - Link related memories with link_memories using: parent_child, cross_reference,
     similar_to, contradicts, elaborates, depends_on

   ## Session Retrospective
   - At the end of significant sessions, self-evaluate: what went well, what went wrong,
     what you'd do differently, key decisions made
   - Store retrospective with: id "retro-YYYY-MM-DD-topic", category "lesson",
     specific actionable lessons (not vague observations)
   - Link retrospectives to related bug fixes, patterns, or decisions
   - Search past retrospectives before starting similar work

Confirm each file you create and show me the final contents.

GitHub Copilot Setup

Open VS Code with Copilot and paste in chat:

Set up mcp-engram-memory as my persistent memory system. Do the following:

1. Create .vscode/mcp.json with a stdio server entry:
   name: engram-memory
   command: dotnet
   args: ["run", "--project", "/path/to/mcp-engram-memory/src/McpEngramMemory"]

2. Create .github/copilot-instructions.md with vector memory instructions:

   ## Recall
   - Before starting any task, use search_memory with the project namespace to recall
     relevant context. Use hybrid: true for keyword+vector fusion and expandGraph: true
     to pull in linked memories. Search for past decisions and bugs before answering.
   - Tool selection: search_memory for project context, dispatch_task for cross-domain
     questions (auto-routes to best expert namespace), consult_expert_panel for multiple
     perspectives, deep_recall for archived/forgotten knowledge, detect_duplicates and
     find_contradictions for memory quality.

   ## Store
   - Store memories after completing tasks, fixing bugs, learning patterns, or receiving
     corrections. Use project directory name as namespace, kebab-case IDs, write text
     with domain keywords for future searchability, categorize as: decision, pattern,
     bug-fix, architecture, preference, lesson, reference.
   - Pre-store quality checks: verify text is self-contained, includes domain keywords,
     doesn't duplicate existing memories, and has the correct category.
   - On duplicate warnings: skip if existing is accurate, upsert if outdated, store and
     link if both are distinct.

   ## Expert Routing
   - dispatch_task routes to experts automatically. If needs_expert is returned, use
     create_expert with a detailed persona description, then populate the expert namespace.
   - Lifecycle: promote STM to LTM when stable and reused across sessions.
   - Link related memories using: parent_child, cross_reference, similar_to, contradicts,
     elaborates, depends_on.

   ## Session Retrospective
   - At the end of significant sessions, store a self-evaluation: what went well, what
     went wrong, lessons learned. Use id "retro-YYYY-MM-DD-topic", category "lesson".
   - Search past retrospectives before starting similar work to avoid repeating mistakes.

Confirm each file you create and show me the final contents.

Google Gemini CLI Setup

Open Gemini CLI and paste in chat:

Set up mcp-engram-memory as my persistent memory system. Do the following:

1. Add the MCP server to my Gemini CLI config (edit ~/.gemini/settings.json):
   name: engram-memory
   command: dotnet
   args: ["run", "--project", "/path/to/mcp-engram-memory/src/McpEngramMemory"]

2. Create GEMINI.md in my workspace root with vector memory instructions:

   ## Recall
   - Before starting any task, use search_memory with the project namespace to recall
     relevant context. Use hybrid: true for keyword+vector fusion and expandGraph: true
     to pull in linked memories. Search for past decisions and bugs before answering.
   - Tool selection: search_memory for project context, dispatch_task for cross-domain
     questions (auto-routes to best expert namespace), consult_expert_panel for multiple
     perspectives, deep_recall for archived/forgotten knowledge, detect_duplicates and
     find_contradictions for memory quality.

   ## Store
   - Store memories after completing tasks, fixing bugs, learning patterns, or receiving
     corrections. Use project directory name as namespace, kebab-case IDs, write text
     with domain keywords for future searchability, categorize as: decision, pattern,
     bug-fix, architecture, preference, lesson, reference.
   - Pre-store quality checks: verify text is self-contained, includes domain keywords,
     doesn't duplicate existing memories, and has the correct category.
   - On duplicate warnings: skip if existing is accurate, upsert if outdated, store and
     link if both are distinct.

   ## Expert Routing
   - dispatch_task routes to experts automatically. If needs_expert is returned, use
     create_expert with a detailed persona description, then populate the expert namespace.
   - Lifecycle: promote STM to LTM when stable and reused across sessions.
   - Link related memories using: parent_child, cross_reference, similar_to, contradicts,
     elaborates, depends_on.

   ## Session Retrospective
   - At the end of significant sessions, store a self-evaluation: what went well, what
     went wrong, lessons learned. Use id "retro-YYYY-MM-DD-topic", category "lesson".
   - Search past retrospectives before starting similar work to avoid repeating mistakes.

Confirm each file you create and show me the final contents.

OpenAI Codex Setup

Open Codex CLI in your project directory and paste:

Set up mcp-engram-memory as my persistent memory system. Do the following:

1. Add the MCP server to my Codex config. Either run:
   codex mcp add engram-memory -- dotnet run --project /path/to/mcp-engram-memory/src/McpEngramMemory
   Or add this to ~/.codex/config.toml (or .codex/config.toml for project-scoped):
   [mcp_servers.engram-memory]
   command = "dotnet"
   args = ["run", "--project", "/path/to/mcp-engram-memory/src/McpEngramMemory"]

2. Create AGENTS.md in the project root with vector memory instructions:

   ## Recall
   - Before starting any task, use search_memory with the project namespace to recall
     relevant context. Use hybrid: true for keyword+vector fusion and expandGraph: true
     to pull in linked memories. Search for past decisions and bugs before answering.
   - Tool selection: search_memory for project context, dispatch_task for cross-domain
     questions (auto-routes to best expert namespace), consult_expert_panel for multiple
     perspectives, deep_recall for archived/forgotten knowledge, detect_duplicates and
     find_contradictions for memory quality.

   ## Store
   - Store memories after completing tasks, fixing bugs, learning patterns, or receiving
     corrections. Use project directory name as namespace, kebab-case IDs, write text
     with domain keywords for future searchability, categorize as: decision, pattern,
     bug-fix, architecture, preference, lesson, reference.
   - Pre-store quality checks: verify text is self-contained, includes domain keywords,
     doesn't duplicate existing memories, and has the correct category.
   - On duplicate warnings: skip if existing is accurate, upsert if outdated, store and
     link if both are distinct.

   ## Expert Routing
   - dispatch_task routes to experts automatically. If needs_expert is returned, use
     create_expert with a detailed persona description, then populate the expert namespace.
   - Lifecycle: promote STM to LTM when stable and reused across sessions.
   - Link related memories using: parent_child, cross_reference, similar_to, contradicts,
     elaborates, depends_on.

   ## Session Retrospective
   - At the end of significant sessions, store a self-evaluation: what went well, what
     went wrong, lessons learned. Use id "retro-YYYY-MM-DD-topic", category "lesson".
   - Search past retrospectives before starting similar work to avoid repeating mistakes.

Confirm each file you create and show me the final contents.

Build & Test

cd mcp-engram-memory
dotnet build
dotnet test

Tests

28 test files with 458 test cases covering:

Test File	Tests	Focus
`CognitiveIndexTests.cs`	43	Vector search, lifecycle filtering, persistence, memory limits
`BenchmarkRunnerTests.cs`	46	IR metrics (Recall@K, Precision@K, MRR, nDCG@K), 5 benchmark datasets, ONNX benchmarks, ablation study
`IntelligenceTests.cs`	39	Duplicate detection, contradictions, reversible collapse, decay tuning, hash embeddings, merge memories
`KnowledgeGraphTests.cs`	20	Edge operations, graph traversal, batch edge creation, edge transfer
`CoreMemoryToolsTests.cs`	20	Store, search, delete memory tool endpoints
`PhysicsEngineTests.cs`	19	Mass computation, gravitational force, slingshot
`AccretionScannerTests.cs`	18	DBSCAN clustering, pending collapses
`DebateToolsTests.cs`	17	Debate tools: validation, cold-start, expert retrieval, edge creation, resolve lifecycle, full E2E pipeline
`ClusterManagerTests.cs`	16	Cluster CRUD, centroid operations, membership transfer
`SqliteStorageProviderTests.cs`	15	SQLite backend: CRUD, persistence, concurrent access, WAL mode
`ExpertToolsTests.cs`	15	dispatch_task/create_expert tools: validation, routing pipeline, context retrieval, full E2E workflows
`ExpertDispatcherTests.cs`	15	Expert creation, routing hits/misses, threshold handling, access tracking, meta-index management
`DebateSessionManagerTests.cs`	14	Session management: alias registration, resolution, TTL purge, namespace generation
`VectorQuantizerTests.cs`	13	Int8 quantization, dequantization roundtrip, SIMD dot product, cosine preservation, edge cases
`LifecycleEngineTests.cs`	12	State transitions, deep recall, decay cycles
`QueryExpanderTests.cs`	9	IDF-based query expansion, term weighting
`RegressionTests.cs`	9	Integration and edge-case scenarios
`PersistenceManagerTests.cs`	9	JSON serialization, debounced saves, checksums
`FloatArrayBase64ConverterTests.cs`	9	Base64 serialization roundtrip, legacy JSON array reading
`QuantizedSearchTests.cs`	8	Two-stage search pipeline, lifecycle-driven quantization, mixed-state ranking
`MetricsCollectorTests.cs`	8	Latency recording, percentile computation, timer pattern
`MaintenanceToolsTests.cs`	7	Rebuild embeddings, compression stats, vector update, metadata preservation
`ChecksumTests.cs`	7	SHA-256 persistence checksums, crash recovery
`AccretionToolsTests.cs`	7	Accretion tool functionality
`DecayBackgroundServiceTests.cs`	2	Background service decay cycles
`AccretionBackgroundServiceTests.cs`	2	Background service lifecycle
`HnswIndexTests.cs`	13	HNSW index: add/search/remove, high-dimensional recall, rebuild, edge cases
`FeedbackTests.cs`	13	Agent feedback: energy boost/suppress, state transitions, access tracking, clamping, cumulative
`InvariantTests.cs`	27	Structural invariants across JSON and SQLite backends
`EmbeddingWarmupServiceTests.cs`	2	Embedding warmup startup behavior

Product	Compatible and additional computed target framework versions.
.NET	net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 was computed. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. net10.0 was computed. net10.0-android was computed. net10.0-browser was computed. net10.0-ios was computed. net10.0-maccatalyst was computed. net10.0-macos was computed. net10.0-tvos was computed. net10.0-windows was computed.

Product

.NET

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

net8.0
- FastBertTokenizer (>= 0.4.67)
- Microsoft.Data.Sqlite (>= 8.0.11)
- Microsoft.Extensions.Hosting.Abstractions (>= 8.0.1)
- Microsoft.Extensions.Logging.Abstractions (>= 8.0.2)
- Microsoft.ML.OnnxRuntime (>= 1.17.0)
- System.Numerics.Tensors (>= 8.0.0)

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last Updated
0.6.1	97	3/31/2026
0.6.0	94	3/31/2026
0.5.5	91	3/25/2026
0.5.4	83	3/22/2026
0.5.2	78	3/22/2026
0.5.1	75	3/21/2026
0.5.0	82	3/21/2026
0.4.1	115	3/10/2026
0.4.0	92	3/10/2026

McpEngramMemory.Core 0.4.1

MCP Engram Memory

Quickstart

Architecture

The Core Memory Loop

Project Structure

NuGet Package

Library Usage

Tech Stack

MCP Tools (38 total)

Core Memory (3 tools)

Knowledge Graph (4 tools)

Semantic Clustering (5 tools)

Lifecycle Management (5 tools)

Admin (2 tools)

Accretion (4 tools)

Intelligence & Safety (5 tools)

Panel of Experts / Debate (3 tools)

Benchmarking & Observability (3 tools)

Maintenance (2 tools)

Expert Routing (2 tools)

Architecture

Services

Background Services

Models

Searchable Compression

Persistence

Environment Variables

Usage

MCP Server

AI Assistant Setup

Claude Code Setup

GitHub Copilot Setup

Google Gemini CLI Setup

OpenAI Codex Setup

Build & Test

Tests

net8.0

NuGet packages

GitHub repositories