CompletedAugust 2025 - December 2025
SimRAG Reproduction Study
Reproduction study of a paper implementing similarity-based RAG with two-stage fine-tuning on consumer hardware.
PythonRAGQdrantSentence TransformersOllamaPurdue GenAI APIPyTorchDockerPoetry
Challenge
Reproduce and understand the SimRAG paper's similarity-based RAG techniques, implementing on consumer hardware to learn RAG fundamentals.
Solution
•
Built provider-agnostic interface supporting local (Ollama) and cloud LLMs
•
Implemented two-stage fine-tuning: instruction following, then domain adaptation
•
Created test suite with mocked dependencies for reproducible testing
Result
Successfully trained and tested on consumer hardware (RTX 3080). Fine-tuning showed limited improvement because the 1.5B model I used was smaller than the paper's 8B/27B models, highlighting the importance of model capacity in RAG systems.