Claude reads entire files to find what it needs. Lumen gives it a map.

Lumen is a 100% local semantic code search engine for AI coding agents. No API keys, no cloud, no external database, just open-source embedding models (Ollama or LM Studio), SQLite, and your CPU. A single static binary and your own local embedding server.

The payoff is measurable and reproducible: across 9 benchmark runs on 9 languages and real GitHub bug-fix tasks, Lumen cuts cost in every single language — up to 39%. Output tokens drop by up to 66%, sessions complete up to 53% faster, and patch quality is maintained in every task. All verified with a transparent, open-source benchmark framework that you can run yourself.

	With Lumen	Baseline (no Lumen)
Cost (avg, bug-fix)	$0.29 (-26%)	$0.40
Time (avg, bug-fix)	125s (-28%)	174s
Output tokens (avg)	5,247 (-37%)	8,323
JavaScript (marked)	$0.32, 119s (-33%, -53%)	$0.48, 255s
Rust (toml)	$0.38, 204s (-39%, -34%)	$0.61, 310s
PHP (monolog)	$0.14, 34s (-27%, -34%)	$0.19, 52s
TypeScript (commander)	$0.14, 56s (-27%, -33%)	$0.19, 84s
Svelte (chat-ui)	$0.10, 56s (-26%, -31%)	$0.14, 80s
Patch quality	Maintained in all 9 tasks	—

Demo
Quick start
What you get
How it works
Benchmarks
Supported languages
Configuration
- Supported embedding models
Controlling what gets indexed
Database location
CLI Reference
Troubleshooting
Development

Demo

Claude Code asking about the Prometheus codebase. Lumen's semantic_search finds the relevant code without reading entire files.

Quick start

Prerequisites:

Platform support: Linux, macOS, and Windows. File locking for background indexing coordination uses flock(2) on Unix and LockFileEx on Windows (via gofrs/flock).

Ollama installed and running, then pull the default embedding model:
```
ollama pull ordis/jina-embeddings-v2-base-code
```
One of: Claude Code, Cursor, Codex, or OpenCode

Note: Installation differs by platform. Claude Code is installed from a plugin marketplace. Codex uses a local MCP server plus native skill discovery. OpenCode installs from npm. Cursor packaging is shipped in this repository and is ready for Cursor's plugin distribution workflow.

Install:

Claude Code

/plugin marketplace add ory/claude-plugins
/plugin install lumen@ory

Verify by starting a new Claude session and running /lumen:doctor.

Cursor

Lumen ships a native Cursor plugin bundle in this repository:

.cursor-plugin/plugin.json - plugin manifest
mcp.json - local lumen MCP server wiring
hooks/hooks-cursor.json - SessionStart hook
skills/ - shared doctor and reindex skills

Use Cursor's plugin installation or distribution workflow with this bundle. Detailed packaging notes: .cursor-plugin/INSTALL.md

Verify by opening a new Cursor agent session and asking it to use the doctor skill or the Lumen semantic_search tool.

Codex

Quick install:

Fetch and follow instructions from https://raw.githubusercontent.com/ory/lumen/refs/heads/main/.codex/INSTALL.md

Manual install:

CODEX_HOME="${CODEX_HOME:-$HOME/.codex}"
git clone https://github.com/ory/lumen.git "$CODEX_HOME/lumen"
mkdir -p "$HOME/.agents/skills"
ln -s "$CODEX_HOME/lumen/skills" "$HOME/.agents/skills/lumen"
codex mcp add lumen -- "$CODEX_HOME/lumen/scripts/run" stdio

Detailed docs: .codex/INSTALL.md

Verify with:

codex mcp get lumen
ls -la "$HOME/.agents/skills/lumen"

OpenCode

Add @ory/lumen-opencode to the plugin array in your opencode.json:

{
  "plugin": ["@ory/lumen-opencode"]
}

Detailed docs: .opencode/INSTALL.md

Verify with:

opencode mcp list

Updating

Claude Code - update through Claude's plugin marketplace
Cursor - refresh or reinstall the bundled plugin through Cursor after updating this repository or the published package
Codex - cd "${CODEX_HOME:-$HOME/.codex}/lumen" && git pull
OpenCode - update the version pin in opencode.json (e.g. @ory/lumen-opencode@0.0.29) and restart OpenCode

On first Claude Code or Cursor session start, Lumen:

Downloads the binary automatically from the latest GitHub release
Indexes your project in the background using Merkle tree change detection
Registers a semantic_search MCP tool that the host can use automatically

In Codex and OpenCode, the same binary download and index seeding happen on the first semantic_search call.

Two shared skills are also available: doctor (health check) and reindex (forced re-indexing). Claude exposes them as /lumen:doctor and /lumen:reindex; the other hosts discover the same shared skill content through their native skill systems.

The same semantic_search, health_check, and index_status MCP tools plus the shared doctor and reindex skills are exposed through the Codex, Cursor, and OpenCode surfaces as well. The first semantic_search call seeds or refreshes the index automatically.

What you get

Semantic vector search — Claude finds relevant functions, types, and modules by meaning, not keyword matching
Auto-indexing — indexes on session start, only re-processes changed files via Merkle tree diffing
Incremental updates — re-indexes only what changed; large codebases re-index in seconds after the first run
12 language families — Go, Python, TypeScript, JavaScript, Svelte, Rust, Ruby, Java, PHP, C/C++, C#, Dart
Git worktree support — worktrees share index data automatically; a new worktree seeds from a sibling's index and only re-indexes changed files, turning minutes of embedding into seconds
Zero cloud — embeddings stay on your machine; no data leaves your network
Ollama and LM Studio — works with either local embedding backend

How it works

Lumen sits between your codebase and Claude as an MCP server. When a session starts, it walks your project and builds a Merkle tree over file hashes: only changed files get re-chunked and re-embedded. Each file is split into semantic chunks (functions, types, methods) using Go's native AST or tree-sitter grammars for other languages. Chunks are embedded and stored in SQLite + sqlite-vec using cosine-distance KNN for retrieval.

Files → semantic chunks → vector embeddings → SQLite/sqlite-vec → KNN search

When Claude needs to understand code, it calls semantic_search instead of reading entire files. The index is stored outside your repo (~/.local/share/lumen/<hash>/index.db), keyed by project path and model name — different models never share an index.

Benchmarks

Lumen is evaluated using bench-swe: a SWE-bench-style harness that runs Claude on real GitHub bug-fix tasks and measures cost, time, output tokens, and patch quality — with and without Lumen. All results are reproducible: raw JSONL streams, patch diffs, and judge ratings are committed to this repository.

Key results — 9 runs across 9 languages, hard difficulty, real GitHub issues (ordis/jina-embeddings-v2-base-code, Ollama):

Language	Cost Reduction	Time Reduction	Output Token Reduction	Quality
Rust	-39%	-34%	-31% (18K → 12K)	Poor (both)
JavaScript	-33%	-53%	-66% (14K → 5K)	Perfect (both)
TypeScript	-27%	-33%	-64% (5K → 1.8K)	Good (both)
PHP	-27%	-34%	-59% (1.9K → 0.8K)	Good (both)
Ruby	-24%	-11%	-9% (6.1K → 5.6K)	Good (both)
Python	-20%	-29%	-36% (1.7K → 1.1K)	Perfect (both)
Go	-12%	-9%	-10% (11K → 10K)	Good (both)
C++	-8%	-3%	+42% (feature task)	Good (both)
Svelte	-26%	-31%	-26% (4.0K → 3.0K)	Poor (both)

Cost was reduced in every language tested. Quality was maintained in every task — zero regressions. JavaScript and TypeScript show the most dramatic efficiency gains: same quality fixes in half the time with two-thirds fewer tokens. Even on tasks too hard for either approach (Rust, Svelte), Lumen cuts the cost of failure by 26–39%.

See docs/BENCHMARKS.md for all 9 per-language deep dives, judge rationales, and reproduce instructions.

Supported languages

Supports 12 language families with semantic chunking (10 benchmarked):

Language	Parser	Extensions	Benchmark status
Go	Native AST	`.go`	Benchmarked: -12% cost, Good quality
Python	tree-sitter	`.py`	Benchmarked: Perfect quality, -36% tokens
TypeScript / TSX	tree-sitter	`.ts`, `.tsx`	Benchmarked: -64% tokens, -33% time
JavaScript / JSX	tree-sitter	`.js`, `.jsx`, `.mjs`	Benchmarked: -66% tokens, -53% time
Dart	tree-sitter	`.dart`	Benchmarked: -76% cost, -82% tokens, -79% time
Rust	tree-sitter	`.rs`	Benchmarked: -39% cost, -34% time
Ruby	tree-sitter	`.rb`	Benchmarked: -24% cost, -11% time
PHP	tree-sitter	`.php`

…

Lumen

Installation

How to install

README

Table of contents