Project Overview

High-Traffic Email RAG Engine

Production RAG pipeline ingesting millions of emails and attachments (PDF, DOCX, XLSX) daily. Hybrid retrieval combines BM25 sparse search via OpenSearch with dense vector search via Qdrant, fused through Reciprocal Rank Fusion. Includes chunking with overlap, per-tenant index isolation, async ingestion queues (Celery/Redis), deduplication, embedding caching, query rewriting, re-ranking with a cross-encoder, and streaming LLM responses — sustaining 10k+ queries/day with p99 latency under 300ms.

PythonOpenSearchQdrantCeleryRedis
PrivateBack to Projects
GALLERY
RRF-fused results with re-ranking
Email & attachment ingestion pipeline

Email & attachment ingestion pipeline

Hybrid sparse + dense search interface