Thoth

"Thoth, scribe of the gods, keeper of knowledge."

Long-term memory for Claude Code agents. Rust-native, code-aware, zero API required.

English · Tieng Viet

Warning

Work in progress — not production-ready. APIs, on-disk formats, and CLI flags may change without notice. Do not rely on it for production workloads yet.

What it is

Thoth gives a Claude Code agent persistent, disciplined memory of a codebase. It parses source with tree-sitter, builds a symbol graph (calls, imports, extends, references), indexes everything with BM25 (tantivy) and RRF fusion, and enforces a "recall before write" gate so the agent consults memory before mutating code.

Three binaries:

thoth — CLI: setup, index, query, eval, memory ops.
thoth-mcp — MCP stdio server (41 tools, 3 prompts, 2 resources).
thoth-gate — PreToolUse hook: blocks writes without prior recall.

thoth setup wires everything — hooks, MCP server, skills, config — in one command.

Nothing leaves your machine unless you opt in.

Install

npx @unknownstudio/thoth        # downloads binary + runs setup wizard

Other channels:

brew install unknown-studio-dev/thoth/thoth
# or
cargo install --git https://github.com/unknown-studio-dev/thoth thoth-cli thoth-mcp
thoth setup

Quickstart

thoth index .                                    # build code index
thoth query "how does the gate work"             # hybrid recall
thoth impact "module::symbol" --direction up     # blast radius
thoth memory fact "tokens expire after 15m" --tags auth

Inside Claude Code, the gate and skills work automatically after setup.

Benchmarks

All numbers from this repo with the commands below. Machine: MacBook Pro 14" (Nov 2023), Apple M3 Pro, 18 GB RAM, release build. Corpus: Thoth's own source tree (109 Rust files, ~47 k LoC). Mode::Zero only (no embedding, no LLM calls).

Recall accuracy (seeded gold set, 10 queries over facts + lessons + code):

cargo test -p thoth-retrieve --test recall_accuracy -- --nocapture

Metric	Value
R@5	100 % (10/10)
R@3	100 % (10/10)
Target	R@5 >= 80 %, R@3 >= 60 %

The test seeds 8 facts, 5 lessons, and 3 Rust source files into a temp store, then runs 10 natural-language queries and asserts each finds the expected substring in the top-k results.

Eval on Thoth's own source tree (8 gold queries):

thoth eval --gold eval/gold.toml -k 8

Metric	Value
P@8	100 % (8/8)
MRR	0.771
Latency p50	75 ms
Latency p95	90 ms

graph_bfs microbenchmark (Criterion):

cargo bench -p thoth-store --bench graph_bfs

Direction	Start	Median
Out	root	1.73 ms
In	deepest leaf	13.5 us
Both	deepest leaf	501 us

Synthetic 4-ary tree (~341 nodes, 5 levels), BFS depth 8. The In direction benefits from the reverse-edge index (edges_by_dst).

Memory

Four working memory kinds, one store:

Kind	Storage	What
Semantic	`graph.redb` + `fts.tantivy/`	Symbols, calls, imports, references (tree-sitter)
Episodic	`episodes.db` (SQLite FTS5)	Every query, outcome, event — timeline for reflect
Reflective	`LESSONS.md`	Lessons from mistakes, confidence-scored, auto-quarantined
Domain	`domain/<ctx>/`	Business rules synced from Notion / Asana / local files

Facts live in MEMORY.md, preferences in USER.md.

Gate (search-before-write)

thoth-gate runs on every Write/Edit/Bash PreToolUse and decides from three factors: intent (read-only Bash bypasses), recency (recent recall passes), relevance (edit tokens scored against recall history). Mode: off / nudge (default) / strict.

Reflection debt (mutations - remembers) adds a second enforcement loop: nudge at 10, hard block at 20. Tunable in config.toml.

Knowledge graph

Temporal entity-relationship triples with validity windows — add, query, invalidate, timeline — backed by SQLite. Available via MCP tools (thoth_kg_*) and CLI.

MCP server

41 tools covering recall, memory CRUD, graph analysis, knowledge graph, overrides, workflows, and conversation archive. Plus 3 prompts (thoth.nudge, thoth.reflect, thoth.grounding_check) and 2 resources (thoth://memory/MEMORY.md, thoth://memory/LESSONS.md).

Run thoth --help or see the tool table in CLAUDE.md.

Background review & compact

thoth review — LLM-driven session review, spawned automatically by PostToolUse hook. Builds context from event logs (~1k tokens), not full conversation. Uses claude-haiku-4-5 by default.

thoth compact — merges near-duplicate facts/lessons. Preview with --dry-run. Backs up originals before overwriting.

Domain memory

Business rules synced from external sources via thoth domain sync. Feature-gated adapters:

Adapter	Feature	Auth
`file`	always on	--
`notion`	`notion`	`NOTION_TOKEN`
`asana`	`asana`	`ASANA_TOKEN`
`notebooklm`	`notebooklm`	-- (stub; export to file)

CLI cheatsheet

thoth setup                            # interactive wizard
thoth index .                          # parse + index
thoth watch .                          # stay resident, reindex on change
thoth query "nudge flow"               # hybrid recall

thoth impact  "mod::sym" -d 3          # blast radius
thoth context "mod::sym"               # 360 symbol view
thoth changes                          # git diff HEAD -> touched symbols

thoth memory show                      # read MEMORY.md + LESSONS.md
thoth memory fact "..." --tags x,y     # append fact
thoth memory lesson --when "..." "..." # append lesson
thoth memory forget                    # TTL + quarantine pass

thoth review                           # LLM session review
thoth compact --dry-run                # preview memory consolidation

thoth domain sync --source file --from ./specs/
thoth eval --gold eval/gold.toml -k 8  # precision@k eval
thoth install                          # wire Claude Code (hooks+MCP+skills)
thoth uninstall                        # remove wiring

thoth --help for the full surface.

Embedding as a library

use thoth_core::Query;
use thoth_parse::LanguageRegistry;
use thoth_retrieve::{Indexer, Retriever};
use thoth_store::StoreRoot;

let store = StoreRoot::open(".thoth").await?;
Indexer::new(store.clone(), LanguageRegistry::new())
    .index_path(".")
    .await?;

let r = Retriever::new(store);
let hits = r.recall(&Query::text("token refresh logic")).await?;

Requirements

Rust >= 1.91 (for building from source)
Git >= 2.30

No API key required for Mode::Zero. For Mode::Full, set ANTHROPIC_API_KEY / VOYAGE_API_KEY as needed.

Contributing

See CONTRIBUTING.md.

Status

Alpha. Core is working: parse, store, graph, retrieve, CLI, MCP, gate, reflection debt, background review, domain sync, knowledge graph, conversation archive. On-disk format may still change.

License

Dual-licensed: Apache 2.0 or MIT.

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
.github		.github
crates		crates
docs		docs
eval		eval
packaging		packaging
scripts		scripts
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.thothignore		.thothignore
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
Makefile		Makefile
README.md		README.md
README.vi.md		README.vi.md
RESEARCH.md		RESEARCH.md
SECURITY.md		SECURITY.md
rust-toolchain.toml		rust-toolchain.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Thoth

What it is

Install

Quickstart

Benchmarks

Memory

Gate (search-before-write)

Knowledge graph

MCP server

Background review & compact

Domain memory

CLI cheatsheet

Embedding as a library

Requirements

Contributing

Status

License

About

Licenses found

Uh oh!

Releases 4

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Thoth

What it is

Install

Quickstart

Benchmarks

Memory

Gate (search-before-write)

Knowledge graph

MCP server

Background review & compact

Domain memory

CLI cheatsheet

Embedding as a library

Requirements

Contributing

Status

License

About

Topics

Resources

License

Licenses found

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages