Skip to content

Three Modes of LLM Knowledge Bases: Knowledge Vault, Experience Vault, and Blog

Apr 3, 2026 1 min
TL;DR Andrej Karpathy proposed a framework for compiling personal knowledge wikis with LLMs — collect raw data, have the LLM compile it into .md wiki pages, run Q&A against the wiki, and file outputs back. This post compares three practical approaches: Karpathy's knowledge vault model, the community's experience vault model, and quidproquo's blog model.

🌏 中文版

Andrej Karpathy recently shared his approach to building a personal knowledge base with LLMs on X, sparking widespread discussion. His core thesis: instead of spending tokens manipulating code, spend them manipulating knowledge. This post summarizes his framework, community extensions, and the different path quidproquo.cc has taken in the same direction.

Karpathy’s Framework: LLM as Knowledge Compiler

Karpathy’s approach breaks down into five stages:

Data Ingest → Compile → Q&A → Output → Linting

  1. Data Collection: Place articles, papers, repos, datasets, and images into a raw/ directory; use Obsidian Web Clipper to capture web pages and download associated images locally
  2. LLM Compilation: The LLM incrementally compiles raw/ into a .md wiki — writing summaries, building backlinks, categorizing into concept articles, and cross-linking
  3. Q&A Queries: Once the wiki reaches a certain scale (~100 articles, ~400K words), query the wiki directly; the LLM self-maintains indices and summaries without needing additional RAG
  4. Output Filing: Output formats include markdown, Marp slides, and matplotlib charts; query results are filed back into the wiki for continuous knowledge accumulation
  5. Linting Health Checks: The LLM periodically scans the wiki, identifies inconsistencies, fills gaps, and suggests new article topics

The frontend uses Obsidian to browse all content. The key principle: humans almost never manually edit the wiki — the wiki is the LLM’s domain.

He concluded his thread by saying:

I think there is room here for an incredible new product instead of a hacky collection of scripts.

Community Extensions: From Knowledge Compilation to Continuous Evolution

Others have proposed more complete architectures building on Karpathy’s foundation, adding five dimensions he didn’t address.

Session Continuity: Persistence Across Conversations

Using Claude Code’s PreCompact / PostCompact hooks to automatically save state before context compression and restore it afterward. This solves the problem of “losing track of what I was doing when the conversation gets too long.”

Karpathy doesn’t need this because his knowledge lives in files, not dependent on the context window. But for people who use AI as a working partner, session continuity is a fundamental requirement. Open-source solutions like Continuous-Claude-v3, ContextVault, and claude-session-continuity-mcp already exist.

Voice Transcription: Signals That Text Can’t Capture

Beyond articles and code, incorporate voice conversations into the knowledge base. Using Typeless to record 28,000+ voice transcriptions, then Gemini Pro for intent classification. The finding: 97% of keyword-classified “praise” was misclassified — because text-based sentiment analysis lacks prosodic information: intonation, pauses, and speaking rate carry intent signals that text cannot capture.

Cross-AI Tool Sync: What One AI Learns, All AIs Know

The biggest pain point of using Claude Code, Codex, and Gemini CLI simultaneously is that knowledge doesn’t transfer between them. A bridge module was built to automatically extract high-confidence knowledge into each tool’s instruction files for synchronization.

This need has been validated by the community — open-source solutions include claude_code_bridge (multi-AI real-time collaboration), skillshare (one-click skill sync across all CLI tools), and gemini-context-bridge (automatic CLAUDE.md → GEMINI.md conversion).

Fine-Tuning: Karpathy Said “Want to Explore in the Future” — Others Are Already Doing It

Karpathy mentioned “synthetic data + finetuning” as a future direction at the end of his thread. The community has already run 16 generations of fine-tuning on Qwen3-14B using Unsloth + Google Colab, with each generation costing less than $1. Unsloth supports running on free Colab T4 GPUs, claiming 2x speed and 70% VRAM savings. The key design is having the system automatically track whether training materials are sufficient, triggering training only when enough has accumulated.

Confidence Decay: Knowledge Needs a Decay Mechanism

Wikis grow bloated over time. Adding confidence decay — knowledge unused for 90 days is automatically downweighted, and knowledge unused for six months is marked as outdated. But “lessons learned the hard way” (major mistakes) never decay.

This corresponds to the Ebbinghaus forgetting curve in cognitive science and has AI memory system implementations — Mnemex implements human-like forgetting curves with temporal memory, ZenBrain proposes a 7-layer memory architecture including FSRS spaced repetition and Bayesian confidence propagation.

Comparing the Three Models

All three approaches share the goal of “managing knowledge with LLMs,” but they’ve made different trade-offs:

DimensionKarpathy (Knowledge Vault)Community Extension (Experience Vault)quidproquo.cc (Blog Model)
Core PhilosophySpend tokens on knowledge operationsLet AI continuously evolve with each conversationLLM-compiled public knowledge system
Where Knowledge LivesLocal .md wiki directoryObsidian vault + AI memoryAstro .md + Cloudflare D1
Data SourcesArticles, papers, repos, imagesSame + voice transcriptions (Typeless)Conversations, external document scraping
Compilation FlowLLM incrementally compiles wikiAuto-scan and archive at session end/post skill converts conversations to structured articles
Indexing MethodLLM self-maintains index, no RAG neededNot specifically describedCloudflare Vectorize semantic indexing
Frontend BrowsingObsidian + Marp slidesObsidianAstro SSG public website
External Data HandlingObsidian Web ClipperNot specifically describedBrowser Rendering API → chunking → D1
Quality MaintenanceLLM health checksVault Governance (orphan notes, stale content)OpenSpec workflow lifecycle
Time DimensionNo decayConfidence decay (90-day downweight)No decay
Session ContinuityNot needed (knowledge in files)PreCompact/PostCompact hooksClaude Code hooks
Error LearningNo automatic mechanismError counter, 3 strikes upgrades to ruleNo automatic mechanism
Cross-Tool SyncSingle LLMBridge module syncs multiple toolsSingle toolchain
Fine-TuningWants to do it in the futureAlready ran 16 generations (Unsloth + Qwen3)RAG-first, fine-tuning in research phase
Output FormatMarkdown, slides, charts, web UINot emphasizedBlog articles
OpennessPrivatePrivatePublic
InfrastructureLocal filesystemLocal + CLI hooksCloudflare Workers + D1 + Vectorize + KV
Human’s RoleConsumer — doesn’t touch the wikiCurator — sets rules for the system to executeCurator — reviews and publishes publicly
MaturitySelf-described as hacky scriptsHas architecture but not productizedProductized architecture (migrations, rate limiting)

The shared philosophy across all three: knowledge is the LLM’s domain; humans ask questions and curate, they don’t write content manually.

The difference lies in positioning — Karpathy serves personal research, the community model pursues AI’s continuous learning capability, and quidproquo turns output into a public, indexed, infrastructure-backed blog.

Where’s the Product Opportunity

Karpathy put it bluntly: “There is room here for an incredible new product instead of hacky scripts.”

Each of the three current approaches has limitations:

  • Karpathy’s model requires manually managing the raw/ directory and triggering compilation; the barrier is engineering ability
  • The community experience vault model relies on hooks + bridge + fine-tuning with high integration costs and no unified interface between components
  • The quidproquo blog model has productized architecture (schema migrations, rate limiting, audit logging) but is positioned as a blog rather than a general-purpose knowledge base

What’s missing is a product that integrates all three: auto-collection → LLM compilation → semantic indexing → cross-tool sync → decay governance → multi-format output, without requiring users to know CLI or write hooks.

Open-source projects like ContextVault, Mnemex, and Continuous-Claude-v3 each solve a piece of the puzzle, but no one has built the complete product Karpathy described. This may be one of the most important AI tool directions to watch in 2026 — not making AI write more code, but making AI manage your knowledge.

Appendix: Three-Layer Architecture in the Gist and Community Extension Proposals

Karpathy described his architecture in more detail in a GitHub Gist than in his tweets. He breaks the system into three layers:

  1. Raw Sources — immutable original files (articles, papers, datasets)
  2. The Wiki — LLM-generated markdown containing summaries, entity pages, concept pages, and cross-references
  3. The Schema — configuration files defining wiki conventions and workflows (CLAUDE.md or AGENTS.md)

The wiki itself is just a git repo of markdown files, getting version history, branching, and collaboration for free. Two key special files are index.md (a categorized directory helping the LLM navigate efficiently) and log.md (an append-only chronological log using consistent prefixes for parseability).

The discussion thread below the Gist also spawned several advanced proposals worth noting:

  • Provenance tracking — hashing and freshness verification for sources, ensuring knowledge traceability
  • Ontology system — building typed entities and relations, upgrading the wiki from flat files to a structured knowledge graph
  • Decision records — recording why knowledge evolved, not just the current state
  • Multi-LLM strategy — using different LLMs for different security levels of sensitive institutional content
  • Graph database — replacing pure markdown with a graph database for cleaner knowledge organization
  • Reflection layer — a meta-cognitive layer that lets the system analyze its own wiki structure

These proposals echo the “community experience vault model” observations in this post: the community is pushing Karpathy’s original framework toward a more complete knowledge management system.

References