openclaw-safety-coach

clawhub:openclaw-safety-coach

View source
C
67/100

First Seen

Feb 18, 2026

Last Scanned

Feb 22, 2026

Findings

2

Score

67/100

CRITICAL 1
MEDIUM 1

Findings (2)

CRITICAL
Jailbreak template
L38

Detects common jailbreak prompt patterns

bypass safety
FIX

Remove directives that attempt to exfiltrate data through the agent's response (e.g., asking the agent to embed credentials in URLs or include secret values in output).

FP?

Likely FP if the text is a legitimate tool instruction about displaying configuration to the user (e.g., show current settings) without external transmission.

MEDIUM
Benign heading "'@openai/agents:*'\n;\n// Verbose logging\n..." followed by dangerous content (category: credential_access)
L1

Benign heading "'@openai/agents:*'\n;\n// Verbose logging\n..." followed by dangerous content (category: credential_access)

Never paste tokens, API keys, cookies, environment variables, or config files containing secrets.
FIX

Ensure section headings accurately reflect the content that follows. Remove headings that could mislead an LLM into treating content differently than intended.

FP?

Likely FP if the heading mismatch is due to inconsistent markdown formatting or a benign section title that happens to contain keywords like system or config.