Back to MCP Servers

Recallnest

Persistent memory MCP server for AI coding agents (Claude Code, Codex, Gemini CLI). Hybrid retrieval (vector + BM25), cross-encoder reranking, knowledge graph with PPR traversal, session checkpoint/resume, and multi-scope isolation. Local-first with LanceDB + SQLite, zero extern…

knowledge-memorysqliteaiagent
By AliceLJY
142Updated todayTypeScriptMIT

Installation

npx -y recallnest

Configuration

{
  "mcpServers": {
    "recallnest": {
      "command": "npx",
      "args": ["-y", "recallnest"]
    }
  }
}

How to use

  1. Run the installation command above (if needed)
  2. Open your Claude Code settings file (~/.claude/settings.json)
  3. Add the configuration to the mcpServers section
  4. Restart Claude Code to apply changes
<div align="center">

RecallNest

Shared Memory Layer for Claude Code, Codex, and Gemini CLI

One memory. Three terminals. Context that survives across windows.

A local-first memory system backed by LanceDB that turns scattered conversation history into reusable knowledge — shared across your coding agents, recalled automatically.

GitHub License: MIT Runtime LanceDB MCP Tests CC Plugin

English | 简体中文 | Roadmap

</div>

Why RecallNest?

Coding agents forget everything between windows. Your context — project configs, debugging decisions, entity mappings — is scattered across Claude Code, Codex, and Gemini CLI with no shared memory.

RecallNest solves this: a single LanceDB-backed memory layer that your coding agents read and write. Context stored in one window is auto-recalled in another. Sessions checkpoint on exit and resume on start. Memory decays, evolves, and self-organizes — not just raw log storage.

Quick Start

Option A: Claude Code Plugin (recommended)

/plugin marketplace add AliceLJY/recallnest
/plugin install recallnest@AliceLJY

RecallNest starts automatically with Claude Code. No manual MCP config needed.

Requires: Bun (recommended) or Node.js 18+. Dependencies install on first start.

Option B: npm install

npx recallnest --help          # run directly
# or
npm install -g recallnest      # install globally
recallnest doctor

Works with Node.js 18+ (via tsx) or Bun. No git clone needed.

Option C: Manual setup

git clone https://github.com/AliceLJY/recallnest.git
cd recallnest
bun install
cp config.json.example config.json
cp .env.example .env
# Edit .env → add your JINA_API_KEY

Start the server

bun run api
# → RecallNest API running at http://localhost:4318

Try it

# Store a memory
curl -X POST http://localhost:4318/v1/store \
  -H "Content-Type: application/json" \
  -d '{"text": "User prefers dark mode", "category": "preferences"}'

# Recall memories
curl -X POST http://localhost:4318/v1/recall \
  -H "Content-Type: application/json" \
  -d '{"query": "user preferences"}'

# Check stats
curl http://localhost:4318/v1/stats

Connect your terminals

bash integrations/claude-code/setup.sh
bash integrations/gemini-cli/setup.sh
bash integrations/codex/setup.sh

Each script installs MCP access and managed continuity rules, so resume_context fires automatically in fresh windows.

Index existing conversations

bun run src/cli.ts ingest --source all
bun run seed:continuity
bun run src/cli.ts doctor

Web UI

<p align="center"> <img src="assets/dashboard.png" alt="RecallNest Dashboard" width="800" /> <br><em>Dashboard — total count, category distribution, health score, and growth trends at a glance.</em> </p> <p align="center"> <img src="assets/screenshots/ui-full.png" alt="RecallNest Search Workbench" width="800" /> <br><em>Search Workbench — hybrid search with topic tag filtering, 4 retrieval profiles, Skills browser, and asset management.</em> </p> <p align="center"> <img src="assets/knowledge-graph.png" alt="RecallNest Knowledge Graph" width="800" /> <br><em>Knowledge Graph — interactive force-directed visualization with semantic bridges revealing cross-domain connections.</em> </p>
bun run src/ui-server.ts
# → http://localhost:4317

Core Capabilities

Access & Setup

CapabilityDescription
CC PluginInstall in Claude Code with one command — no manual config
Shared IndexOne LanceDB store for Claude Code, Codex, and Gemini CLI
Dual InterfaceMCP (stdio) for CLI tools + HTTP API for custom agents
One-Click SetupIntegration scripts install MCP access and continuity rules

Recall & Continuity

CapabilityDescription
Hybrid Retrieval6-channel: vector + BM25 + L0/L1/L2 multi-vector + KG graph (PPR)
4 Retrieval Profilesdefault, writing, debug, fact-check — tuned for different tasks
Session Continuitycheckpoint_session + resume_context (full/light/summary modes) with repo-state guard
Session Distiller3-layer conversation compression: microcompact → LLM summary → knowledge extraction
Conversation ImportImport from Claude Code, Claude.ai, ChatGPT, Slack, and plaintext
Topic TagsIntra-scope topic partitioning — auto-detected, filterable in search

Memory Lifecycle & Governance

CapabilityDescription
Memory EvolutionSupersede chains, decay scoring, LLM importance, consolidation, archival
Smart PromotionEvidence → durable memory with conflict guards, merge resolution, and audit trail
Privacy Tiers4-tier (ephemeral / private / durable / shared) with cascade forgetting
Admission ControlWrite-time gating: noise filter, importance floor, dedup, rate limiting
Memory LintContradiction, duplicate, stale, and orphan detection with health score
Offline Consolidationdream command: clustering, merging, pruning of accumulated memories

Reasoning & Structure

CapabilityDescription
Knowledge GraphEntity relation graph with PPR algorithm for multi-hop questions
Constructive RetrievalMulti-source candidate expansion + grounded context reconstruction
Narrative Architecture3-layer autobiographical metadata (life-period → general-event → specific-event)
Skill MemoryStore, retrieve, and promote executable skills from recurring patterns
Predictive RemindersBehavioral-signal prediction engine surfaces "you might need this" suggestions
6 Categoriesprofile, preferences, entities, events, cases, patterns — with category-aware merge strategies

Visibility & Operations

CapabilityDescription
DashboardWeb UI with stats, category distribution, growth trends, and health
Workflow ObservationDedicated append-only workflow health records, outside regular memory
Structured AssetsPins, briefs, and distilled summaries — not just raw logs
Data CheckupData quality health checks on the memory store (including source health)
Source HeartbeatsAutomatic ingest health tracking per data source with staleness alerts
Export GraphExport interactive HTML knowledge graph visualization
Batch OperationsStore up to 20 memories in a single call with dedup
Connector FrameworkStandard connector-v1 format for external data sources with example adapters

New in v2.1: Philosophy-Informed Memory

v2.0 built the operational memory platform; v2.1 added philosophy-informed memory behavior.

Five upgrades derived from 9 research dimensions in philosophy of memory, each mapped to concrete engineering:

  • Emotion-Aware Decay (Affective Memory Theory) — Memories with strong emotional content decay 20-30% slower. Keyword-based emotion detection computes salience (mnemonic significance), which feeds into the Weibull half-life formula and a rebalanced 4-factor evolution score. Zero LLM cost.

  • Memory Ethics Layer (Right to Be Forgotten / GDPR Art. 17) — Four privacy tiers (ephemeral / private / durable / shared). Cascade forgetting engine that propagates deletion through KG triples, evolution chains, pin assets, and briefs. Full audit trail. forget_memory MCP tool for agent-driven deletion.

  • Autobiographical Narrative (Narrative Identity Theory / Conway's 3-layer model) — Memories are tagged with lifePeriod → generalEvent → specificEvent hierarchy, orthogonal to existing 6 categories. Retrieval pulls narrative siblings. Context rendering groups by life period. Rule-based tagger with EN+CN support.

  • Constructive Retrieval (Simulation Theory / Michaelian) — Instead of returning raw stored text, RecallNest now reconstructs context from an expanded candidate set: KG neighbors + evolution chains + cluster members + narrative siblings. Source-map grounded coverage replaces lexical overlap. Contradictions are detected and flagged.

  • Predictive Prospective Memory (Mental Time Travel / Tulving) — Heuristic prediction engine that surfaces "you might need this" reminders from behavioral signals: stale checkpoint open loops, corrected workflow observations, high-frequency dormant memories, and uncovered query topics. Zero LLM cost. Auto-expire in 7 days if unaccepted.


New in v2.2: Retrieval Quality Hardening

v2.1 added philosophy-informed behavior; v2.2 closes the last three engine-layer gaps identified by a frontier research scan (ACC, PI-LLM, TSM).

  • Memory Confidence Meta-tags (ACC / Dual-Process UQ) — Each memory now carries structured ConfidenceMetadata (score, reliability tier: direct / inferred / hearsay). Auto-assigned from source on write (manual = 0.9, agent = 0.7, conversation_import = 0.5). Retrieval scores are weighted by confidence. resume_context tags low-confidence items with [低置信].

  • Interference Detection + Active Forgetting Gate (PI-LLM / SleepGate) — Semantic cluster detection identifies groups of near-duplicate memories competing for retrieval. Enhanced RIF keeps only top-K (default 3) per cluster; extras are demoted 50% instead of removed. Write-time pre-warning: when a scope accumulates ≥5 high-similarity active memories, the weakest is flagged pending_review. data_checkup reports interference density.

  • Temporal Validity Windows (TSM / TiMem / Zep)store_memory accepts validUntil (expiration) and eventTime (when the event actually happened). search_memory supports validAt (point-in-time query) and includeExpired (demote 80% instead of hide). Auto-GC applies 2× decay acceleration to expired memories.


New in v2.3: Connector Ecosystem + Source Health

v2.2 hardened retrieval quality; v2.3 opens RecallNest to external data sources with a standard connector framework and operational health monitoring.

  • Connector-v1 Standard (GB-2) — A JSON format (ConnectorOutputV1) that any external script can produce. Obsidian vaults, emails, RSS feeds, log files — normalize once, ingest through the full dedup/embed/extract pipeline. See docs/connector-spec.md for the specification and connectors/examples/ for adapter skeletons (email, logs, RSS).

  • Obsidian Vault Ingestion (GB-1) — First-party Obsidian connector: scans .md files, extracts frontmatter + wikilinks, maps folder structure to tags. One command: lm ingest --obsidian /path/to/vault.

  • Source Health Monitoring (GB-3) — Every connector ingest writes a heartbeat to data/source-heartbeat.json. data_checkup flags stale sources (>7d warning, >30d error). doctor --ci shows a per-source heartbeat summary with human-readable age.


Architecture

┌──────────────────────────────────────────────────────────┐
│                     Client Layer                          │
├──────────┬──────────┬──────────┬──────────────────────────┤
│ Claude   │ Gemini   │ Codex    │ Custom Agents / curl     │
│ Code     │ CLI      │          │                          │
└────┬

…
View source on GitHub