nwhitehouse builds a five-pass research engine for a small model
The biggest single feature in this fork: a research pipeline that fires automatically the moment you tick a legal database or the web as a source.
Instead of asking one big model one big question, nwhitehouse breaks legal research into five stages - expand the user's question into several sharper queries, fan out searches in parallel across legal databases and the web, triage the hits down to the most relevant, pull tailored extracts from each, then stitch the answer together with inline citations. Hard ceilings keep it honest: no more than 25 model calls and 45 seconds.
The twist is that it's tuned for Olava-001, a small reasoning model - not a frontier one. A new loading animation and live "thinking" indicator paper over the latency, and the chat UI now shows each research step as it happens, with ranked sources threaded underneath. End-to-end, it returns a cited two-kilobyte answer in about thirty seconds; the same prompt to the bare model produced nothing.
Spotted something wrong? Or know the PR text has fresher detail than the writeup above?