Skip to main content

Indexing and Logging

Two special files help the LLM (and you) navigate the wiki as it grows. They serve different purposes.

index.md — content-oriented catalog

index.md is a catalog of everything in the wiki — each page listed with a link, a one-line summary, and optional metadata like date or source count. Organized by category (entities, concepts, sources, etc.).

The LLM updates it on every ingest. When answering a query, the LLM reads the index first to find relevant pages, then drills into them.

Example structure:

# Wiki Index

## Sources
- [OpenAI o3 announcement](sources/openai-o3.md) — Dec 2024 reasoning model, benchmark results, pricing
- [Karpathy LLM OS talk](sources/karpathy-llm-os.md) — LLMs as OS kernel analogy, tool use patterns

## Entities
- [Andrej Karpathy](entities/karpathy.md) — ex-OpenAI/Tesla, educator, micrograd/nanoGPT author
- [OpenAI](entities/openai.md) — lab, products, key papers

## Concepts
- [Chain-of-thought prompting](concepts/chain-of-thought.md) — step-by-step reasoning elicitation
- [RLHF](concepts/rlhf.md) — reinforcement learning from human feedback

## Synthesis
- [Scaling laws overview](synthesis/scaling-laws.md) — cross-source synthesis, updated 2026-03

This approach works surprisingly well at moderate scale (~100 sources, ~hundreds of pages) and avoids the need for embedding-based RAG infrastructure.

log.md — chronological record

log.md is an append-only record of what happened and when — ingests, queries, lint passes.

Format tip: start each entry with a consistent prefix so the log is parseable with simple Unix tools:

## [2026-04-19] ingest | Karpathy llm-wiki.md
Added summary page. Updated concepts/rag.md, concepts/personal-knowledge-management.md.
Created new page: concepts/llm-wiki-pattern.md. 3 new cross-references added.

## [2026-04-18] query | "How does chain-of-thought compare to process reward models?"
Filed answer as synthesis/cot-vs-prm.md.

## [2026-04-15] lint
Found 2 contradictions (flagged inline), 1 orphan page (concepts/sparse-attention.md),
suggested 3 new sources to investigate.

With this prefix, the log is trivially queryable:

# Last 5 entries
grep "^## \[" log.md | tail -5

# All ingests
grep "^## \[.*\] ingest" log.md

# All lint passes
grep "^## \[.*\] lint" log.md

The log gives you a timeline of the wiki's evolution and helps the LLM understand what's been done recently when starting a new session.