nwhitehouse wires in a self-hosted reasoning model
The fork adds a third AI option alongside the usual two - a self-hosted model the team can run on its own hardware.
Upstream Mike talks to two big commercial AI providers (Anthropic's Claude and Google's Gemini). nwhitehouse bolts on a third lane called Olava: a fine-tuned version of Qwen, an open-weights model from Alibaba, served from the team's own infrastructure rather than a vendor's cloud. The frontend now shows Olava as an available option without each user having to paste in their own API key.
Most of the work is plumbing for the quirks of running a reasoning model yourself - cleaning up the model's internal "thinking" chatter before it reaches the user, giving it enough room to actually think, and teaching the app to understand a non-standard way this particular model asks to use tools. There's also a small polish fix so party names in generated Word documents render bold instead of showing literal asterisks.
Spotted something wrong? Or know the PR text has fresher detail than the writeup above?