Project Overview
GPT Document Intelligence
Upload entire PDF libraries and query them with natural language. Built on LangChain, OpenAI embeddings, and Pinecone. Features hybrid BM25 + dense retrieval, citation tracking, and a streaming chat UI. Handles 10k-page corpora with sub-200ms retrieval.
PythonLangChainOpenAIPineconeFastAPI
PrivateBack to Projects
GALLERY
PDF library upload & indexing pipeline