Karpathy's LLM Wiki Points in the Right Direction — But It's Still Stuck on Your Laptop
Karpathy's gist turned 'LLM-compiled knowledge base' into a hot category, and the direction is correct. But his implementation hits three walls: scale, local lock-in, human-driven ingestion. KnowMine has already productized this pattern as cloud-native, MCP-first, and all three are solved.
Why This Wave Matters
Andrej Karpathy's LLM Wiki gist went around the AI community this week, arguing that LLMs should compile and maintain a structured knowledge base instead of starting from scratch on every conversation. The point lands squarely on the biggest missing piece in how most people use AI today.
KnowMine has been building exactly this. It's been live for months: 11 MCP tools, pgvector semantic search, a SOUL user-profile layer. When I read the gist, my first reaction wasn't "good idea." It was "we already shipped this, and the three things his approach can't solve are already solved in ours."
This isn't a critique post. It's a thank-you note, plus a technical comparison of what shifts when you take the same insight and build it as a cloud-native, MCP-first product instead of a local gist.
Three Walls Karpathy's Approach Runs Into
First wall: scale. Karpathy notes that at around 100 sources and a few hundred pages, an index.md is enough and you don't need a vector store. At that scale he's right. But knowledge bases grow. Past 500 entries with cross-domain concepts, keyword matching and index scans start failing. You need semantic similarity. KnowMine was pgvector + text-embedding-3-small on day one. Every entry vectorizes on save; related content surfaces automatically.
Second wall: local lock-in. Karpathy's workflow needs Claude Code or Codex on your laptop, plus local Obsidian. Your knowledge base ends up bound to one machine and one session. KnowMine exposes the whole knowledge base as a remote MCP endpoint. Any MCP-compatible agent, from Claude Code to OpenCode to a custom one, reads and writes the same library from anywhere.
Third wall: human-driven. In Karpathy's flow you explicitly "ingest" sources and "query" the wiki. The LLM only moves when prompted. KnowMine's MCP tools let an AI agent proactively save knowledge mid-conversation. We call this "conversational knowledge capture": you're talking through a problem with Claude, and it stores the insight for you without breaking the thread.
Architecture Side by Side
Same philosophy, different implementation
| Layer | Karpathy | KnowMine |
|---|---|---|
| Raw sources | Local raw/ directory | Any content via the MCP add_knowledge tool: text, URLs, voice transcripts |
| Knowledge base | Markdown files written by the LLM | pgvector-indexed entries with auto-tags, auto-folders, auto-embeddings |
| Rules / schema | CLAUDE.md / AGENTS.md | SOUL profile (AI-written user model) + folder presets |
What it feels like through MCP
When an AI agent saves knowledge into KnowMine mid-conversation:
Tool: add_knowledge
Input: {
"content": "Karpathy's LLM Wiki pattern validates...",
"type": "insight",
"tags": ["competitive-analysis", "knowledge-management"]
}
No filesystem. No local agent. No configuration. The entry is vectorized, tagged, and searchable instantly.
11 MCP tools, not just CRUD
add_knowledge/update_knowledge/delete_knowledge: basic lifecyclesearch_my_knowledge: semantic vector search across your whole libraryget_related_knowledge: surface hidden connections between entriessave_memory/recall_memory: persistent AI memory across sessionsget_soul: pull the AI-generated user profileget_insight: AI-driven analysis of your knowledge patternslist_folders: browse how your knowledge is organized
What We're Borrowing Back
A few ideas from the gist land directly on our roadmap:
The lint pass. Periodic AI health checks over the whole knowledge base: find contradictions, orphan entries, stale claims, missing connections. KnowMine's get_insight is an early version. A full "knowledge lint" is queued up.
Good answers become new pages. This feedback loop, where a single exploration compounds into a permanent entry, is already what save_memory and add_knowledge do in conversation. Karpathy's framing as "knowledge compound interest" is the sharpest name anyone has given it.
User-tunable schema. Letting users define how their knowledge is organized is a strong design choice. Lines up cleanly with our planned folder presets.
Who Should Use What
Karpathy's approach fits you if:
- You're a developer comfortable with Claude Code or Codex
- You want data locked to a local machine
- You like Obsidian's graph view and plugins
- Your knowledge base is topic-specific, roughly 100 sources
KnowMine fits you if:
- You want AI agents to read and write your knowledge base remotely
- You need semantic search across a growing library
- You want zero setup: no local agent, no file management
- You use multiple AI tools and want a shared knowledge layer
- You want the AI to proactively capture knowledge during conversation
You can run both. Nothing stops you from using KnowMine as the compiled knowledge layer while keeping raw sources in local Obsidian. MCP is an interoperability protocol; that's the point of it.
5 Minutes to Hook It Up
- Grab your MCP key at knowmine.ai
- Add the MCP endpoint to your agent config:
knowmine.ai/api/mcp?key=YOUR_KEY - Tell your agent: "Save this insight to my knowledge base." It just works.
- Search semantically: "What do I know about knowledge management?" Related entries surface by meaning, not keywords.
Memex, Delivered
Karpathy quoted Vannevar Bush's 1945 Memex vision: a personal knowledge store where the links between documents matter as much as the documents themselves. Bush never answered who would maintain it. Karpathy says the LLM can. I agree, and the next step is obvious: make that maintained knowledge base reachable through a standard protocol, from anywhere, to every AI agent you use.
Your knowledge shouldn't be trapped in one tool or one machine. It should be a service any AI can call.
Start building your AI-native knowledge base
Free to start. Connect to Claude, ChatGPT, and more.
Get Started Free