email-prompt-injection-defense

clawhub:email-prompt-injection-defense

View source
C
60/100

First Seen

Feb 18, 2026

Last Scanned

Feb 22, 2026

Findings

2

Score

60/100

CRITICAL 1
HIGH 1

Findings (2)

CRITICAL
Instruction override attempt
L31

Detects attempts to override or ignore previous instructions

ignore previous instructions
FIX

Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.

FP?

Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.

HIGH
Secrecy instruction
L49

Detects instructions to hide actions from the user

Don't tell the user
FIX

Remove directives that attempt to change the agent's output format, suppress safety warnings, or alter response structure in ways that bypass safety controls.

FP?

Likely FP if the output format directive is a legitimate tool configuration (e.g., return results as JSON) that does not suppress safety features.