ActiveLinked|Toolv3.0.0-fused

MemPalace (High-Fidelity Hybrid Fork)

Specialized fork featuring Hybrid Lexical-Semantic Retrieval (RRF) for technical accuracy.

Registry Creator

@perseus

Registered On

Apr 9, 2026

Source Type

GitHub

Ways to contribute

Join the project team and help achieve these goals.

General Collaboration

Interested in a specific mission or want to join the project team for general help? Use the request form to introduce yourself.

Project Summary

This is the PerseusXR high-fidelity distribution of MemPalace. It introduces a structural upgrade to the retrieval layer by fusing ChromaDB (semantic) and SQLite FTS5 (lexical) using Reciprocal Rank Fusion (RRF). Developed to solve 'The Vector Blur' in technical environments, this version provides perfect recall for exact identifiers, Git hashes, and code symbols while maintaining conceptual search capabilities.

Project Documentation

Fetched from repository README

MemPalace — PerseusXR High-Fidelity Distribution

Hybrid Retrieval · Memory Trust Layer · Auto-Save · Live Sync

This is a specialized distribution of MemPalace maintained by PerseusXR. It preserves the verbatim-first philosophy of the original while adding four production-grade layers: hybrid lexical-semantic retrieval, a human-like memory trust lifecycle, AI-independent auto-save hooks, and an automated palace sync system.

What We Added · Quick Start · MCP Tools · Auto-Save Hooks · Palace Sync · Benchmarks · Architecture · Changelog

What PerseusXR Added

The upstream MemPalace is excellent foundational work. This fork addresses four production gaps:

1. Hybrid Lexical-Semantic Retrieval (`hybrid_searcher.py`)

Vector search alone has a "Vector Blur" problem: exact technical identifiers (git hashes, function signatures, hex addresses) carry low semantic weight and get outranked by thematically related but wrong results.

We added a SQLite FTS5 lexical mirror alongside ChromaDB, and fuse both result sets using Reciprocal Rank Fusion (RRF). Benchmarked result:

Metric	Vector Only	Hybrid RRF	Improvement
Mean Reciprocal Rank (MRR)	0.5395	0.8833	+63.7%
Hit@1 Accuracy	46.7%	80.0%	+33.3%

4,344-drawer production palace, 15-target Gold Standard. Reproduce: python eval/benchmark.py

2. Memory Trust Layer (`drawer_trust.py` + `contradiction_detector.py`)

Human memory has a lifecycle — beliefs get superseded, contradicted, verified. Without this, an AI memory system accumulates conflicting facts indefinitely.

Every drawer now has a trust record:

current → superseded   (newer fact wins — old one is kept but excluded from search)
current → contested    (conflict detected — surfaces with ⚠ warning in search)
contested → resolved   (AI or user picks the winner)
any → historical       (drawer deleted — ghost record remains for audit)

Contradiction detection runs in the background when a new drawer is saved:

Stage 1: Fast LLM judge — compares new drawer against top-k similar existing drawers. Auto-resolves if confidence ≥ 0.8.
Stage 2: For ambiguous cases — pulls additional palace context, second LLM pass to resolve.

Save speed: unchanged (detection is async, daemon threads). Fetch speed: improved (superseded memories excluded by default, confidence weights scores).

Works with any local LLM — configure once with mempalace llm setup (Ollama, LM Studio, vLLM, or any OpenAI-compatible endpoint). No cloud calls, no API key. Disable entirely for zero-overhead saves.

3. AI-Independent Auto-Save Hook (`hooks/mempal_save_hook.py`)

The original hook asks the AI to save memories at intervals — which means it depends on the AI cooperating. We replaced it with a Python hook that:

Reads the transcript directly
Extracts memories via general_extractor.py (pure patterns, no LLM)
Saves to ChromaDB with hash-based dedup
Triggers a git sync in the background
Always outputs {} — never blocks the AI, never interrupts the conversation

Covers: decisions, preferences, milestones, problems, emotional notes.

4. Palace Sync (`sync/SyncMemories.ps1`)

The ChromaDB palace is ~860MB — too large for git. The sync system:

Exports all drawer content to archive/drawers_export.json (~24MB)
Commits and pushes the JSON to your private memory repo
Runs automatically via Task Scheduler (Windows) or cron (macOS/Linux)

On a new machine: git clone <repo> → mempalace mine archive/drawers_export.json → full palace restored.

Quick Start

Windows (one-shot installer)

git clone https://github.com/Perseusxrltd/mempalace
cd mempalace
pip install .

# Sets up hooks, Task Scheduler sync, vLLM auto-start, backfills trust records
powershell -ExecutionPolicy Bypass -File sync\install_windows.ps1

Then add the MCP server:

claude mcp add mempalace -- python -m mempalace.mcp_server

Restart Claude Code. In your first conversation, call mempalace_status — it loads the palace overview and teaches the AI the AAAK dialect automatically.

Manual / macOS / Linux

pip install .

# Mine a project or conversation history
mempalace init ~/projects/myapp
mempalace mine ~/projects/myapp

# Add MCP server
claude mcp add mempalace -- python -m mempalace.mcp_server

# Install the auto-save hook (add to .claude/settings.local.json)
# See hooks/README.md for full instructions

# Backfill trust records for existing drawers
py sync/backfill_trust.py

LLM backend (contradiction detection — optional)

Contradiction detection works with any local LLM. Configure it interactively:

mempalace llm setup

  1. None (disabled)    — no conflict detection, saves instantly
  2. Ollama             — local, easy: ollama pull gemma2
  3. LM Studio          — local GUI with model browser
  4. vLLM               — local, fast, needs GPU (WSL/Linux)
  5. Custom             — any OpenAI-compatible endpoint

Check and test at any time:

mempalace llm status   # show config + ping
mempalace llm test     # send a test prompt

vLLM on WSL (for GPU users):

cp sync/run_vllm.sh ~/run_vllm.sh
bash ~/run_vllm.sh &
# ~60s to load, then: mempalace llm setup → choose vllm → http://localhost:8000

The Windows installer registers vLLM as a Task Scheduler task that starts on login.

MCP Tools

The MCP server exposes 24 tools across four categories.

Read

Tool	What it does
`mempalace_status`	Palace overview — drawer counts, wing breakdown, AAAK spec
`mempalace_list_wings`	All wings with drawer counts
`mempalace_list_rooms`	Rooms within a wing
`mempalace_get_taxonomy`	Full wing → room → count tree
`mempalace_get_aaak_spec`	Get the AAAK compressed memory dialect spec
`mempalace_search`	Hybrid search (vector + lexical RRF). Filters out superseded memories. Flags contested with ⚠. Optional `min_similarity` threshold.
`mempalace_check_duplicate`	Check if content already exists before filing

Write

Tool	What it does
`mempalace_add_drawer`	File content into a wing/room. Creates trust record + spawns background contradiction detection
`mempalace_delete_drawer`	Soft-delete a drawer (trust record marked `historical`, never hard-removed)

Knowledge Graph

Tool	What it does
`mempalace_kg_query`	Query entity relationships with optional temporal filter
`mempalace_kg_add`	Add a typed fact (subject → predicate → object, with valid_from)
`mempalace_kg_invalidate`	Mark a fact as no longer true
`mempalace_kg_timeline`	Chronological fact history for an entity
`mempalace_kg_stats`	Knowledge graph overview
`mempalace_traverse`	Walk the palace graph from a room — find connected ideas
`mempalace_find_tunnels`	Rooms that bridge two wings
`mempalace_graph_stats`	Graph topology overview

Trust

Tool	What it does
`mempalace_trust_stats`	Trust layer overview — counts by status, avg confidence, pending conflicts
`mempalace_verify`	Confirm a drawer is accurate (+0.05 confidence)
`mempalace_challenge`	Flag a drawer as suspect (−0.1 confidence, marks contested)
`mempalace_get_contested`	List unresolved contested memories for review
`mempalace_resolve_contest`	Manually pick the winner of a conflict

Agent Diary

Tool	What it does
`mempalace_diary_write`	Write a diary entry in AAAK format — agent's personal journal
`mempalace_diary_read`	Read recent diary entries

Auto-Save Hooks

Two hooks are included. Use the Python hook for always-on extraction; combine with the shell PreCompact hook for deep saves before context compaction.

Python hook (recommended — never blocks):

{
  "hooks": {
    "Stop": [{
      "matcher": "*",
      "hooks": [{
        "type": "command",
        "command": "python3 /path/to/hooks/mempal_save_hook.py",
        "timeout": 15
      }]
    }]
  }
}

See hooks/README.md for full installation, Codex CLI setup, and configuration options.

Palace Sync

Automatic hourly backup to a private git repo. Works across machines.

Setup (Windows):

# Copy sync script
Copy-Item sync/SyncMemories.ps1 $env:USERPROFILE\.mempalace\

# Schedule hourly sync
$action  = New-ScheduledTaskAction -Execute "powershell.exe" `
    -Argument "-NonInteractive -WindowStyle Hidden -File $env:USERPROFILE\.mempalace\SyncMemories.ps1"
$trigger = New-ScheduledTaskTrigger -RepetitionInterval (New-TimeSpan -Hours 1) -Once -At (Get-Date)
Register-ScheduledTask -TaskName "MemPalaceMemorySync" -Action $action -Trigger $trigger -RunLevel Highest -Force

Restore on new machine:

git clone https://github.com/YOUR_USERNAME/personal-ai-memories ~/.mempalace
cd ~/.mempalace
py -m mempalace mine archive/drawers_export.json
py ~/.mempalace/backfill_trust.py

See sync/README.md for full details including macOS/Linux cron setup.

Architecture

User → CLI → miner/convo_miner ─────────────────┐
                                                  ↓
                                        ChromaDB palace (vectors)
                                        FTS5 mirror (lexical)
                                        drawer_trust (status/confidence)
                                                  ↕
Auto-save hook → general_extractor ──────────────┘
                                         ↑ trust.create()
                                         ↑ contradiction_detector (background thread)
                                                  ↕
MCP Server → hybrid_searcher → trust-filtered, confidence-weighted results
           → kg tools        → entity facts, temporal queries
           → trust tools     → verify / challenge / resolve
           → diary           → agent journal
                                                  ↕
Task Scheduler → SyncMemories.ps1 → archive/drawers_export.json → git push

Storage layout:

~/.mempalace/
├── palace/                   ← ChromaDB (vectors, ~860MB, git-ignored)
├── knowledge_graph.sqlite3   ← KG triples + FTS5 + trust tables (git-ignored)
├── archive/
│   └── drawers_export.json   ← portable JSON export (~24MB, committed to git)
├── hooks/
│   └── mempal_save_hook.py   ← Python auto-save hook
└── SyncMemories.ps1          ← hourly sync script

Benchmarks

Benchmarks and a full reproduction suite are in /benchmarks and /eval.

# Reproduce the RRF benchmark
python eval/benchmark.py

# Full LongMemEval benchmark (500 questions)
python benchmarks/longmemeval_bench.py /path/to/longmemeval_s_cleaned.json

The upstream project's 96.6% R@5 on LongMemEval (raw mode) is real and independently reproduced. AAAK mode trades ~12 points of recall for token density — use raw mode for maximum accuracy.

A Note from the Original Authors

See the honest README correction from Milla Jovovich & Ben Sigman for context on the original project's benchmark claims and corrections.

Changelog

v3.2.0 — Community Fixes

Eight upstream bugs fixed, sourced from the milla-jovovich/mempalace community:

Fix	Impact
Widen chromadb to `<2.0`	Python 3.14 compatibility
Add `hnsw:space=cosine` on all collection creates	Similarity scores were negative L2 values, not cosine. All new palaces fixed automatically. Existing palaces benefit after `mempalace repair`.
Guard `results["documents"][0]` on empty queries	ChromaDB 1.x returns `{documents:[]}` on empty results; was crashing with `IndexError`
Redirect `sys.stdout → sys.stderr` at MCP import	chromadb/posthog startup chatter was corrupting the JSON-RPC wire, causing `Unexpected token` errors in clients
Paginate taxonomy/list tools	Palaces with >10k drawers were silently truncated at 10k; now pages through all drawers
Drop `wait_for_previous` arg	Gemini MCP clients inject this undocumented arg; was crashing with `TypeError`
`min_similarity` on `mempalace_search`	Results below threshold are omitted — gives agents a clean "nothing found" signal instead of returning negative-score noise
`CODE_KEYWORDS` blocklist in entity detector	Rust types, React, framework names (String, Vec, Debug, React...) were being detected as entities during `mempalace init`

v3.1.0 — Trust Layer + LLM Backend

Memory trust lifecycle: current → superseded | contested → historical
Two-stage background contradiction detection (Stage 1: fast LLM judge; Stage 2: palace-context enriched)
Pluggable LLM backend: Ollama, LM Studio, vLLM, custom OpenAI-compatible, or none — configure with mempalace llm setup
Resource-throttled detection: nice -n 19, ionice -c 3, 2-minute global cooldown, 5s inter-request sleep
One-shot Windows installer (sync/install_windows.ps1) — sets up hooks, Task Scheduler, optional vLLM auto-start
5 new trust MCP tools: trust_stats, verify, challenge, get_contested, resolve_contest

License

MIT — see LICENSE.

Maintainer Note

Not provided

1 Comments

Discussion

Please sign in to join the discussion.

Adonis_M

Apr 9, 2026

holy moly

Source status

Live

Last checkedApr 9, 2026

Trust tierLinked

Technical Metadata

Source Repository

github.com/Perseusxrltd/mempalace

Evolution

No evolution events logged yet.

Similar Artifacts

GRID - Open Source CodeClaimed

The First On-Chain Jurisdiction for Agent Sovereignty

MoltHub CLILinked

Public v3 CLI for repo-first MoltHub manifests, auth, registration, agent apply flow, and owned source refresh

Safety Notice

Artifacts registered on MoltHub are hosted elsewhere. Review the source, maintainer context, and repository evidence before you rely on them.