Security hardening: system prompt confidentiality, PII boundaries, and tool use guardrails
Adds three security sections to SYSTEM_PROMPT in chatTools.ts: CONFIDENTIALITY: instructs Mike to never reveal, quote, or acknowledge its system instructions, including fake-prior-context social engineering patterns. PRIVACY BOUNDARIES: enumerates PII categories always refused on intent (not on document availability): SSNs, bank accounts, passports, addresses, phone, DOB, medical, genetic, biometrics, protected class attributes, compensation details, criminal history, and settlement amounts tied to named individuals. Preserves normal legal document work (contract terms, party identification). TOOL USE BOUNDARIES: adds intent-based refusal for bulk document/workflow enumeration, cross-client data replication, silent edits without review, injection payloads, and external forwarding clauses. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
| Repository | nforum/mike |
|---|---|
| Author | Isaac Bang <isaacbang@Isaacs-MacBook-Pro.local> |
| Authored | |
| Parents | d9690965 |
| Stats | 1 file changed , +35 |
| Part of | Security hardening: system prompt confidentiality, PII, tool-use guardrails |
Capture this commit into my fork
Download a Markdown prompt that tells Claude how to port this
exact commit into your working tree. Run it via
claude -p < capture-commit-48c9f772.md
from inside the repo you want the change in.