Long-term memory for your AI

Capture every conversation. Distill durable knowledge. Recall compressed context on the next call.

IN ACTION

See what it looks like

Real screenshots from a running Rohrpost instance — the MCP integration your AI sees, and the knowledge graph you control.

Claude MCP panel showing rohrpost connected with 11 tools listed
MCP Integration

Claude sees 11 rohrpost tools the moment it connects — ingest, recall, graph traversal, OKF export, and more. Zero custom code required.

Rohrpost admin panel showing the Knowledge Graph canvas with 56 entities and colored nodes
Knowledge Graph

The admin panel renders your entire knowledge graph as an interactive canvas. Drill into any entity, trace relationships, and spot gaps at a glance.

FEATURES

Everything you need for AI memory

Six core capabilities that turn ephemeral AI conversations into persistent, searchable knowledge — deployed on your infrastructure.

On-Prem First

Runs on a single VM or a full cluster via Docker Compose or Helm. Your data never leaves your network — no cloud provider lock-in, no third-party data processing agreements required.

Knowledge Graph

Entities and relationships are extracted into an Apache AGE graph database. This enables multi-hop reasoning across conversations, surfacing connections that flat vector search misses entirely.

Hybrid Retrieval

Combines vector similarity (pgvector), keyword matching (BM25 via Tantivy), and graph traversal into a single query. Results are merged using Reciprocal Rank Fusion for consistently relevant recall.

MCP Native

Ships a Model Context Protocol server exposing four tools — ingest, recall, graph, and forget. Any MCP-compatible client (Claude, Cursor, VS Code) can read and write your knowledge base without custom integration code.

Stream Everything

All data flows through NATS JetStream with at-least-once delivery guarantees. Capsules are durably stored and replayable, so no interaction is ever lost even during downstream outages.

Built in Rust

A single static binary (~30 MB) with no runtime dependencies. Compiles to native code for sub-millisecond proxy overhead and runs comfortably on a 1 vCPU / 1 GB RAM instance.

✦ New

Graph Explorer

A visual entity and relationship browser built into the admin panel. Navigate the knowledge graph with breadcrumb trails, drill into neighborhoods, and trace multi-hop connections across all your ingested conversations.

Skills Registry

Agents register their available tools and capabilities into Rohrpost. The registry indexes what's callable across your fleet, so recall surfaces not just knowledge but also the right tool to act on it.

Open Knowledge Format

Export and import your entire knowledge base in a portable, standardized format. Migrate between instances, share curated datasets with teammates, or archive conversations for compliance — without vendor lock-in.

HOW IT WORKS

From conversation to context in four steps

A durable pipeline that captures every AI interaction and transforms it into compressed, retrievable knowledge.

1

Capture

Your AI interactions produce capsules — prompts, responses, diffs, and errors — streamed durably via NATS JetStream with at-least-once delivery.

2

Transport

Capsules flow through the tube to the distiller. NATS JetStream guarantees delivery with full replay capability for reprocessing or auditing.

3

Distill

The distiller chunks text, generates vector embeddings via pgvector, builds BM25 indexes with Tantivy, and extracts entities into an Apache AGE knowledge graph.

4

Recall

On the next LLM call, hybrid retrieval (vector + BM25 + graph walk) merges results via Reciprocal Rank Fusion into a token-budgeted brief injected as context.

USE CASES

Built for every scale

Whether you are a solo developer or an enterprise team, Rohrpost adapts to your workflow and infrastructure requirements.

Solo Developers

The Problem

You repeat the same context to your AI assistant every session. Previous conversations vanish, forcing you to re-explain architecture decisions, coding patterns, and project constraints from scratch.

The Solution

Rohrpost captures every interaction and recalls relevant context automatically. Persistent memory across sessions saves 40% on token costs by eliminating redundant context windows.

Development Teams

The Problem

Knowledge lives in individual chat histories that teammates cannot access. Onboarding new developers means re-discovering decisions that were already made and explained to an AI months ago.

The Solution

Shared knowledge graph lets any team member query the collective AI memory. The OpenAI-compatible proxy integrates without code changes — existing applications work immediately.

Enterprises

The Problem

Sensitive IP flows through third-party AI providers with no audit trail. Compliance teams cannot verify what data was shared or how AI responses influenced production decisions.

The Solution

Self-hosted deployment keeps all data on-premise with a full audit trail for compliance. Every capsule is durably stored and replayable, providing complete observability into AI interactions.

DEPLOYMENT

Deploy in under a minute

A single static binary (~30 MB) that runs on 1 vCPU and 1 GB RAM. No JVM, no runtime dependencies, no warm-up time.

Single VM

Docker Compose on any Linux box. One command to production.

Learn more →

Kubernetes

Helm chart for production clusters with horizontal scaling.

Learn more →

Managed / SaaS

Hosted Rohrpost with zero ops at app.rohrpost.io. Focus on building, not infrastructure.

Learn more →
docker compose up