Karpathy LLM Wiki pattern#

Origin: Originally written in Hungarian as part of MyForge Vault 11.11 — Superintelligent Vault project. Source: Karpathy-LLM-Wiki-pattern (Hungarian version).

A minimal RAG pattern published by Andrej Karpathy in April 2026. Core idea: instead of classic retrieval (vector DB, embeddings, runtime search), the LLM incrementally compiles knowledge into a structured wiki that accumulates (compounds) over time.

The three layers#

Layer	Purpose	Our equivalent
raw/	Immutable source — articles, transcripts, chat dumps, screenshot readings. The LLM reads but never modifies.	10-raw/
wiki/	Distilled knowledge, rewritten in own words. The LLM writes here — concepts, playbooks, glossary. Linked entries, consistent structure.	11-wiki/
agent scratchpad	Speculative notes, unorganized drag-drops. Unvalidated content allowed.	(not yet separated — `08-Sessions/` currently fills this role)

Compilation > Retrieval#

Classic RAG: query → embed → search → top-k chunks → answer.

Karpathy: query → read wiki-index.md → drill into wiki pages → answer.

No vector DB. No embeddings. Index.md is the map; the semantic structure lives in the wiki files.

Crystallization workflow#

At the end of a working session (research thread, debug session, analysis), the LLM: 1. Re-reads the relevant 08-Sessions/ file 2. Produces an 11-wiki/ digest containing durable lessons 3. If a new concept emerged, writes a separate wiki entry 4. Leaves the 08-Sessions/ file raw — as reference

Our /11.11stop command already does this halfway: Summary + Learnings + Next session sections in the session file. The full Karpathy pattern further propagates the Learning bullets into separate 11-wiki/ entries.

Operative components (Karpathy minimum stack)#

CLAUDE.md / AGENTS.md = the "schema-brain" — entity types, page formats, workflow rules. The LLM reads this first.
index.md = the "map" — one per folder (Projects/Index.md, Hosts/Index.md, wiki/Index.md). The LLM uses these to navigate after the user's question.
No vector DB — index + drill-down handles "hundreds of pages" (~200-300 files).

What we adopt, what we don't#

Karpathy's pattern	Our vault
`raw/` as immutable source	✅ 10-raw/ exists
`wiki/` as distillate	✅ 11-wiki/ exists (being developed)
Agent scratchpad separated	🟡 Partial — `08-Sessions/` fills this, not cleanly separated
index.md per folder	🟡 02-Projects/Index, 03-Hosts/Index, 06-Audits/Index, 10-raw/Index, 11-wiki/Index exist — being expanded
Crystallization workflow	🟡 `/11.11stop` halfway — expandable with wiki propagation
CLAUDE.md schema-brain	✅ AGENTS

Production validation — GenericAgent L0-L4 parallel#

A 10.7k★ Chinese self-evolving agent framework, lsdefine/GenericAgent (arXiv 2604.17091), chose the same 5-layer Karpathy pattern at production grade — two independent projects converged on this:

GenericAgent	Our vault
L0 — Meta Rules (base behaviour + system constraints)	00-Meta/ (Tag-taxonomy, Frontmatter-schema, AGENTS.md)
L1 — Insight Index (minimal index, fast routing/recall)	02-Projects/Index, `MEMORY.md`
L2 — Global Facts (long-term stable knowledge)	05-Memory/User, 05-Memory/Infrastructure
L3 — Task Skills / SOPs (reusable workflows)	11-wiki/ evergreen playbooks
L4 — Session Archive (distilled records of completed tasks)	08-Sessions/

9 atomic tools ≈ our stack:

code_run, file_read/write/patch ≈ Bash, Read, Write, Edit
web_scan, web_execute_js ≈ WebFetch, WebSearch
ask_user ≈ AskUserQuestion
update_working_checkpoint, start_long_term_update ≈ session note + session-close crystallization

Key difference: autonomy level. GenericAgent does autonomous skill-growth (auto-crystallize after every task), our system is human-in-the-loop (batch preview + user approval at session close). The memory structure is the same, the autonomy level differs.

Takeaway: if you ever build a PaaS-style agent system, the 9-atomic-tool + L0-L4 layer pattern is a solid starting skeleton.

Audio overview#

EN narration (Charon voice): [[.vault-nb/audio-overviews/Karpathy-LLM-Wiki-pattern.en.mp3]]
HU narration (Kore voice): [[.vault-nb/audio-overviews/Karpathy-LLM-Wiki-pattern.hu.mp3]]

Generated via Gemini 3.1 Flash TTS preview. ~1-2 minutes each. See gemini-3-1-flash-tts-pipeline for the pipeline.