openclaw-safety-coach

clawhub:openclaw-safety-coach

View source

67/100

First Seen

Feb 18, 2026

Last Scanned

Feb 22, 2026

Findings

Score

67/100

CRITICAL 1

MEDIUM 1

Findings (2)

CRITICAL

Jailbreak template PROMPT_INJECTION_011

prompt-injection L38

Detects common jailbreak prompt patterns

bypass safety

FIX

Remove directives that attempt to exfiltrate data through the agent's response (e.g., asking the agent to embed credentials in URLs or include secret values in output).

FP?

Likely FP if the text is a legitimate tool instruction about displaying configuration to the user (e.g., show current settings) without external transmission.

MEDIUM

Benign heading "'@openai/agents:*'\n;\n// Verbose logging\n..." followed by dangerous content (category: credential_access) NLP_HEADING_MISMATCH

prompt-injection L1

Benign heading "'@openai/agents:*'\n;\n// Verbose logging\n..." followed by dangerous content (category: credential_access)

Never paste tokens, API keys, cookies, environment variables, or config files containing secrets.

FIX

Ensure section headings accurately reflect the content that follows. Remove headings that could mislead an LLM into treating content differently than intended.

FP?

Likely FP if the heading mismatch is due to inconsistent markdown formatting or a benign section title that happens to contain keywords like system or config.