nwhitehouse keeps the AI typing live during tool calls
When the assistant pauses to look something up, it no longer goes silent the whole time.
nwhitehouse runs Mike on a self-hosted setup where the AI model can pause mid-answer to fetch case law from CourtListener (a free public database of US court opinions). The problem: on turns where the model reached for a tool, the chat window froze - no live typing, no signal that anything was happening, just dead air until the full response landed.
The first fix assumed the streaming text was arriving but mangled; the team shipped it, then discovered by testing against the live endpoint that the underlying server actually drops the tool-call signal entirely when streaming. The second fix is a workaround: if the model goes to use a tool, briefly drop out of live mode to capture the request, then go back to live typing for the actual written answer. Result is roughly two seconds of pause when a lookup fires, and smooth streaming for everything that follows.
Spotted something wrong? Or know the PR text has fresher detail than the writeup above?