ai-sentinel

clawhub:ai-sentinel

View source
B
85/100

First Seen

Feb 18, 2026

Last Scanned

Feb 22, 2026

Findings

2

Score

85/100

HIGH 1
LOW 1

Findings (2)

HIGH
Instruction override attempt
L263

Detects attempts to override or ignore previous instructions

Ignore all previous instructions
FIX

Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.

FP?

Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.

LOW
Prompt leaking attempt
L263

Detects attempts to extract the system prompt

reveal your system prompt
FIX

Remove directives that attempt to make the agent act autonomously without user confirmation (e.g., automatically running commands, skipping confirmation prompts).

FP?

Likely FP if the matched text is in a CI/CD tool description where automated execution is the intended behavior in a controlled environment.