brauliogusmao wires Mike up to run on your own hardware
This fork can now talk to local AI models running on the user's own machine - no cloud round-trip required.
The team has added support for Ollama, a tool that lets you run open-source AI models locally instead of sending every query out to OpenAI or Anthropic. In the chat interface, a new "Local (Ollama)" group appears in the model picker, populated automatically from whatever models the user has installed.
The more interesting touch is how the fork handles the fact that smaller local models aren't as disciplined as the big cloud ones. When the assistant needs to pull in a document, it's allowed to ask once - and then that capability is quietly taken away for the rest of the conversation, so a chattier local model doesn't keep re-fetching the same files in a loop. It's a small accommodation that says someone actually sat with these models and watched how they behave.
Spotted something wrong? Or know the PR text has fresher detail than the writeup above?