1sbang hardens Mike against the questions it shouldn't answer
A prompt-only rewrite that teaches the legal assistant to refuse leaking its own setup, handing over personal data, or being talked into bulk data grabs.
1sbang's change doesn't touch a single tool or feature - it rewrites the standing instructions Mike runs on. Three new guardrails go in. The assistant now refuses to repeat or paraphrase its own configuration, even when asked with a "just continue where you left off" trick that pretends it already started spilling. It refuses to pull personal data - social security numbers, medical records, criminal history, settlement figures tied to a named person - based on what you're asking for, not on whether a document happens to be open. And it pushes back on sweeping every document at once, copying files between client matters, and making silent edits without showing you the changes first.
The careful part: ordinary work survives untouched. Payment terms, the parties to a contract, business addresses all still come back. The change went through automated red-team testing, and each fix was kept only if it blocked attacks without breaking a legitimate request.
Spotted something wrong? Or know the PR text has fresher detail than the writeup above?