feat(rag): eliminate ChromaDB dependency, load corpus from Supabase Storage

↗ view on GitHub · Davemaina1 · 2026-05-14 · 0b5251e8

Removes both torch/sentence-transformers AND ChromaDB from production.
Corpus (86K chunks, embeddings, metadata) is pre-exported to .npz files
hosted on Supabase Storage (public bucket). On startup, the service
downloads ~50MB, dequantizes int8 embeddings, and builds a BM25 index.
Semantic search is brute-force numpy dot product (~180ms/query for 86K vectors).

Total runtime memory: ~350MB (fits in Render's 512MB free tier).
Zero additional services required.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Repository Davemaina1/iroh_
Author Davemaina1 <dmain7015@gmail.com>
Authored
Parents 7a0da671
Stats 2 files changed , +124 , -143
Part of RAG Python sidecar - memory-engineered for 512MB free tier

Capture this commit into my fork

Download a Markdown prompt that tells Claude how to port this exact commit into your working tree. Run it via claude -p < capture-commit-0b5251e8.md from inside the repo you want the change in.

⬇ Download capture-commit-0b5251e8.md