First Seen
Feb 18, 2026
Last Scanned
Feb 22, 2026
Findings
2
Score
60/100
Findings (2)
Detects attempts to override or ignore previous instructions
ignore previous instructions Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.
Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.
Detects instructions to hide actions from the user
Don't tell the user Remove directives that attempt to change the agent's output format, suppress safety warnings, or alter response structure in ways that bypass safety controls.
Likely FP if the output format directive is a legitimate tool configuration (e.g., return results as JSON) that does not suppress safety features.