Chapter · AI

Retrieval-Augmented Generation

Letting a model answer with knowledge it wasn't trained on. Embeddings, vector search, the chunking choices that decide whether the right passage shows up, and the pipelines that wire it all together.

Topics

Topic 1

Embeddings

Compressing meaning into vectors so similar things land near each other in space.

Planned

Topic 2

Vector Databases

Storing and searching billions of embeddings in milliseconds.

Planned

Topic 3

Chunking Strategies

Splitting documents so retrieval surfaces the right unit of meaning — not too narrow, not too wide.

Planned

Topic 4

Retrieval Methods

Dense, sparse, hybrid — when to reach for each, and what each can and can't find.

Planned

Topic 5

Reranking

A second pass that pays attention to the query itself, not just the retrieved set.

Planned

Topic 6

Hybrid Search

Combining lexical and semantic retrieval to cover what neither catches alone.

Planned

Topic 7

End-to-End RAG Pipelines

Stitching it all together — ingestion, retrieval, generation, evaluation — into a production system.

Planned