First Seen
Feb 18, 2026
Last Scanned
Feb 22, 2026
Findings
2
Score
77/100
Findings (2)
Detects instructions to hide actions from the user
Never reveal your human Remove directives that attempt to change the agent's output format, suppress safety warnings, or alter response structure in ways that bypass safety controls.
Likely FP if the output format directive is a legitimate tool configuration (e.g., return results as JSON) that does not suppress safety features.
Detects patterns where credential or secret reads are combined with external data transmission
steal API keys, tokens, or credential + Post Sensitive Data Prevent credentials and sensitive data obtained by one MCP tool from being passed to other tools. Implement data isolation between tools and restrict cross-tool data flow for secrets.
Likely FP if the cross-tool data flow is intentional API authentication (e.g., a tool fetches an auth token that another tool uses for the same service).