Observations on Building RAG Systems for Technical Documents
Published in ICLR 2024 Tiny Papers Track, 2024
Abstract: Retrieval augmented generation (RAG) for technical documents creates challenges as embeddings do not often capture domain information. We review prior art for important factors affecting RAG and perform experiments to highlight best practices and potential challenges to build RAG systems for technical documents.
Recommended citation: Soman, Sumit, and Sujoy Roychowdhury. "Observations on Building RAG Systems for Technical Documents." ICLR 2024 Tiny Papers Track (2024).
Download Paper