Table of Contents

Six parts, twenty chapters, three appendices. Parts II, III, V, and VI are designed to stand on their own, so readers can enter at the layer of the stack that matters most to them. Each chapter below shows a short summary and a status indicator so you can tell what is ready to read.

Part I

Why Hybrid Search

After Part I, you'll understand exactly where keyword and vector search fail and have a decision framework for when hybrid retrieval is worth the complexity.

Readers evaluating whether hybrid search is worth the investment, or building the case for a migration, should start here. Readers already convinced that hybrid search is the right approach can skip to Part II.

Chapter 1

The Limits of Keyword Search

Get Sample

For three decades, every mainstream open-source search engine has relied on the same core idea: match the terms in a query against an inverted index and rank by a scoring function like BM25. This chapter explains how BM25 works, why that approach has been so durable, and the specific, well-documented ways it fails silently on a substantial fraction of real user queries.

Chapter 2

The Limits of Vector Search

Get Sample

Vector search promises to solve the vocabulary mismatch problem by matching meaning instead of words, and on aggregate benchmarks it has decisively overtaken BM25. In exchange, it introduces a different category of failures (entity confusion, hallucinated similarity, and blindness to negation) whose silent nature often makes them more difficult to surface and correct than the ones BM25 creates.

Chapter 3

The Case for Hybrid

Coming soon

Where each retrieval paradigm breaks down is not arbitrary: lexical and dense methods miss different queries in predictable, measurable ways, and the overlap between their failure sets is small. This chapter develops that observation, walks through the three main fusion approaches (Reciprocal Rank Fusion, weighted linear interpolation, and learned fusion), surveys benchmark evidence for hybrid consistently beating either approach alone, and offers a decision framework for deciding when the added complexity earns its keep.

Appendices

  • Appendix A.Mathematical Foundations Quick Reference
  • Appendix B.Benchmark Datasets for Search Evaluation
  • Appendix C.Migration Playbook