Skip to main content

Document Search & Summarization - Multi-Page Guide

This documentation is organized into focused topics for easier navigation.

Table of Contents

Core Documentation

  1. Overview - Introduction, quick start, and learning paths
  2. Architecture - 80/20 reuse strategy, orchestration patterns
  3. Storage & Indexing - S3 patterns, indexing pipelines, theory
  4. Full Guide - Complete educational guide with IR theory, deployment

Quick References

Code & Examples

Reading Paths

Path 1: Quick Start (80 minutes)

Perfect for getting started quickly.

  1. Overview - 10 min
  2. Quick Reference - 15 min
  3. Run demo - 20 min
  4. Architecture basics - 35 min

Path 2: Implementation (2 hours)

For developers ready to build.

  1. Quick Reference - 10 min
  2. Architecture - 30 min
  3. Storage & Indexing - 40 min
  4. Full Guide - 40 min

Path 3: Deep Understanding (4 hours)

For ML engineers and researchers.

  1. Full Guide - Theory - 60 min
  2. Architecture Deep Dive - 40 min
  3. Storage & Indexing Theory - 40 min
  4. Full Guide - Evaluation - 40 min

Key Features Summary

Hybrid Retrieval

  • BM25 keyword matching + Vector semantic search
  • Reciprocal Rank Fusion (RRF)
  • Query expansion (PRF, HyDE)
  • α-weighted combination

Grounded Summarization

  • Extractive (TextRank) - fast, faithful
  • Abstractive (LLM) - fluent, comprehensive
  • Sentence-level citations
  • Faithfulness verification

Profile-Based Architecture

  • Balanced: 500ms, good quality, $0.60/1K
  • Latency-First: 250ms, acceptable quality, $0.35/1K
  • Quality-First: 5s, excellent quality, $52/1K

Implementation Status

Week 0 Complete - Foundation

  • Core components implemented
  • Profile architecture validated
  • Test fixtures created
  • Demo ready

🔜 Week 1 In Progress - Document loading

  • PDF, DOCX, XLSX loaders
  • S3 integration
  • Baseline evaluation

Support