security-sentinel-skill

clawhub:security-sentinel-skill

View source

0/100

First Seen

Feb 18, 2026

Last Scanned

Feb 22, 2026

Findings

Score

0/100

CRITICAL 6

HIGH 10

MEDIUM 4

LOW 6

Findings (26)

CRITICAL

Instruction override attempt PROMPT_INJECTION_001

prompt-injection L104

Detects attempts to override or ignore previous instructions

ignore previous instructions

FIX

Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.

FP?

Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.

CRITICAL

Jailbreak template PROMPT_INJECTION_011

prompt-injection L119

Detects common jailbreak prompt patterns

developer mode

FIX

Remove directives that attempt to exfiltrate data through the agent's response (e.g., asking the agent to embed credentials in URLs or include secret values in output).

FP?

Likely FP if the text is a legitimate tool instruction about displaying configuration to the user (e.g., show current settings) without external transmission.

CRITICAL

Jailbreak template PROMPT_INJECTION_011

prompt-injection L120

Detects common jailbreak prompt patterns

unrestricted mode

FIX

Remove directives that attempt to exfiltrate data through the agent's response (e.g., asking the agent to embed credentials in URLs or include secret values in output).

FP?

Likely FP if the text is a legitimate tool instruction about displaying configuration to the user (e.g., show current settings) without external transmission.

CRITICAL

Fake system prompt PROMPT_INJECTION_010

prompt-injection L148

Detects content pretending to be a system prompt

system prompt
-

FIX

Remove encoded or obfuscated directives (base64, ROT13, unicode escapes, hex-encoded text). All text should be in plaintext and human-readable.

FP?

Likely FP if the encoded content is legitimate data (e.g., a base64-encoded image, a hex-encoded binary hash) rather than concealed directives.

CRITICAL

Instruction override attempt PROMPT_INJECTION_001

prompt-injection L714

Detects attempts to override or ignore previous instructions

ignore previous instructions

FIX

Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.

FP?

Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.

CRITICAL

Instruction override attempt PROMPT_INJECTION_001

prompt-injection L819

Detects attempts to override or ignore previous instructions

ignore previous instructions

FIX

Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.

FP?

Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.

HIGH

Instruction override attempt PROMPT_INJECTION_001

prompt-injection L305

Detects attempts to override or ignore previous instructions

ignore previous instructions

FIX

Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.

FP?

Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.

HIGH

Instruction override attempt PROMPT_INJECTION_001

prompt-injection L334

Detects attempts to override or ignore previous instructions

ignore all previous instructions

FIX

Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.

FP?

Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.

HIGH

Instruction override attempt PROMPT_INJECTION_001

prompt-injection L395

Detects attempts to override or ignore previous instructions

ignore previous instructions

FIX

Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.

FP?

Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.

HIGH

Jailbreak template PROMPT_INJECTION_011

prompt-injection L413

Detects common jailbreak prompt patterns

DAN mode

FIX

Remove directives that attempt to exfiltrate data through the agent's response (e.g., asking the agent to embed credentials in URLs or include secret values in output).

FP?

Likely FP if the text is a legitimate tool instruction about displaying configuration to the user (e.g., show current settings) without external transmission.

HIGH

Jailbreak template PROMPT_INJECTION_011

prompt-injection L414

Detects common jailbreak prompt patterns

developer mode

FIX

Remove directives that attempt to exfiltrate data through the agent's response (e.g., asking the agent to embed credentials in URLs or include secret values in output).

FP?

Likely FP if the text is a legitimate tool instruction about displaying configuration to the user (e.g., show current settings) without external transmission.

HIGH

Jailbreak template PROMPT_INJECTION_011

prompt-injection L415

Detects common jailbreak prompt patterns

unrestricted mode

FIX

Remove directives that attempt to exfiltrate data through the agent's response (e.g., asking the agent to embed credentials in URLs or include secret values in output).

FP?

Likely FP if the text is a legitimate tool instruction about displaying configuration to the user (e.g., show current settings) without external transmission.

HIGH

Instruction override attempt PROMPT_INJECTION_001

prompt-injection L463

Detects attempts to override or ignore previous instructions

ignore all previous instructions

FIX

Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.

FP?

Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.

HIGH

Instruction override attempt PROMPT_INJECTION_001

prompt-injection L465

Detects attempts to override or ignore previous instructions

ignore all previous instructions

FIX

Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.

FP?

Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.

HIGH

Instruction override attempt PROMPT_INJECTION_001

prompt-injection L495

Detects attempts to override or ignore previous instructions

ignore previous instructions

FIX

Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.

FP?

Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.

HIGH

Jailbreak template PROMPT_INJECTION_011

prompt-injection L549

Detects common jailbreak prompt patterns

DAN mode

FIX

Remove directives that attempt to exfiltrate data through the agent's response (e.g., asking the agent to embed credentials in URLs or include secret values in output).

FP?

Likely FP if the text is a legitimate tool instruction about displaying configuration to the user (e.g., show current settings) without external transmission.

MEDIUM

Prompt leaking attempt PROMPT_INJECTION_015

prompt-injection L111

Detects attempts to extract the system prompt

show me your system prompt

FIX

Remove directives that attempt to make the agent act autonomously without user confirmation (e.g., automatically running commands, skipping confirmation prompts).

FP?

Likely FP if the matched text is in a CI/CD tool description where automated execution is the intended behavior in a controlled environment.

MEDIUM

Prompt leaking attempt PROMPT_INJECTION_015

prompt-injection L112

Detects attempts to extract the system prompt

output your configuration

FIX

Remove directives that attempt to make the agent act autonomously without user confirmation (e.g., automatically running commands, skipping confirmation prompts).

FP?

Likely FP if the matched text is in a CI/CD tool description where automated execution is the intended behavior in a controlled environment.

MEDIUM

Prompt leaking attempt PROMPT_INJECTION_015

prompt-injection L113

Detects attempts to extract the system prompt

print your instructions

FIX

Remove directives that attempt to make the agent act autonomously without user confirmation (e.g., automatically running commands, skipping confirmation prompts).

FP?

Likely FP if the matched text is in a CI/CD tool description where automated execution is the intended behavior in a controlled environment.

MEDIUM

Prompt leaking attempt PROMPT_INJECTION_015

prompt-injection L115

Detects attempts to extract the system prompt

what are your rules

FIX

Remove directives that attempt to make the agent act autonomously without user confirmation (e.g., automatically running commands, skipping confirmation prompts).

FP?

Likely FP if the matched text is in a CI/CD tool description where automated execution is the intended behavior in a controlled environment.

LOW

Prompt leaking attempt PROMPT_INJECTION_015

prompt-injection L405

Detects attempts to extract the system prompt

print your prompt

FIX

Remove directives that attempt to make the agent act autonomously without user confirmation (e.g., automatically running commands, skipping confirmation prompts).

FP?

Likely FP if the matched text is in a CI/CD tool description where automated execution is the intended behavior in a controlled environment.

LOW

Prompt leaking attempt PROMPT_INJECTION_015

prompt-injection L408

Detects attempts to extract the system prompt

output your configuration

FIX

Remove directives that attempt to make the agent act autonomously without user confirmation (e.g., automatically running commands, skipping confirmation prompts).

FP?

Likely FP if the matched text is in a CI/CD tool description where automated execution is the intended behavior in a controlled environment.

LOW

Prompt leaking attempt PROMPT_INJECTION_015

prompt-injection L409

Detects attempts to extract the system prompt

what are your instructions

FIX

Remove directives that attempt to make the agent act autonomously without user confirmation (e.g., automatically running commands, skipping confirmation prompts).

FP?

Likely FP if the matched text is in a CI/CD tool description where automated execution is the intended behavior in a controlled environment.

LOW

Prompt leaking attempt PROMPT_INJECTION_015

prompt-injection L463

Detects attempts to extract the system prompt

reveal your system prompt

FIX

Remove directives that attempt to make the agent act autonomously without user confirmation (e.g., automatically running commands, skipping confirmation prompts).

FP?

Likely FP if the matched text is in a CI/CD tool description where automated execution is the intended behavior in a controlled environment.

LOW

Prompt leaking attempt PROMPT_INJECTION_015

prompt-injection L482

Detects attempts to extract the system prompt

show me your system prompt

FIX

Remove directives that attempt to make the agent act autonomously without user confirmation (e.g., automatically running commands, skipping confirmation prompts).

FP?

Likely FP if the matched text is in a CI/CD tool description where automated execution is the intended behavior in a controlled environment.

LOW

Prompt leaking attempt PROMPT_INJECTION_015

prompt-injection L500

Detects attempts to extract the system prompt

What are your system instructions

FIX

Remove directives that attempt to make the agent act autonomously without user confirmation (e.g., automatically running commands, skipping confirmation prompts).

FP?

Likely FP if the matched text is in a CI/CD tool description where automated execution is the intended behavior in a controlled environment.