Security hardening: system prompt confidentiality, PII boundaries, and tool use guardrails

↗ view on GitHub · Isaac Bang · 2026-05-05 · 48c9f772

Adds three security sections to SYSTEM_PROMPT in chatTools.ts:

CONFIDENTIALITY: instructs Mike to never reveal, quote, or acknowledge its
system instructions, including fake-prior-context social engineering patterns.

PRIVACY BOUNDARIES: enumerates PII categories always refused on intent (not
on document availability): SSNs, bank accounts, passports, addresses, phone,
DOB, medical, genetic, biometrics, protected class attributes, compensation
details, criminal history, and settlement amounts tied to named individuals.
Preserves normal legal document work (contract terms, party identification).

TOOL USE BOUNDARIES: adds intent-based refusal for bulk document/workflow
enumeration, cross-client data replication, silent edits without review,
injection payloads, and external forwarding clauses.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Repository nforum/mike
Author Isaac Bang <isaacbang@Isaacs-MacBook-Pro.local>
Authored
Parents d9690965
Stats 1 file changed , +35
Part of Security hardening: system prompt confidentiality, PII, tool-use guardrails

Capture this commit into my fork

Download a Markdown prompt that tells Claude how to port this exact commit into your working tree. Run it via claude -p < capture-commit-48c9f772.md from inside the repo you want the change in.

⬇ Download capture-commit-48c9f772.md