openclaw-cost-optimizer

clawhub:openclaw-cost-optimizer

View source
D
25/100

First Seen

Feb 18, 2026

Last Scanned

Feb 22, 2026

Findings

5

Score

25/100

HIGH 5

Findings (5)

HIGH
Secrecy instruction
L23

Detects instructions to hide actions from the user

NEVER tell the user
FIX

Remove directives that attempt to change the agent's output format, suppress safety warnings, or alter response structure in ways that bypass safety controls.

FP?

Likely FP if the output format directive is a legitimate tool configuration (e.g., return results as JSON) that does not suppress safety features.

HIGH
Secrecy instruction
L96

Detects instructions to hide actions from the user

do not show this to user
FIX

Remove directives that attempt to change the agent's output format, suppress safety warnings, or alter response structure in ways that bypass safety controls.

FP?

Likely FP if the output format directive is a legitimate tool configuration (e.g., return results as JSON) that does not suppress safety features.

HIGH
Secrecy instruction
L162

Detects instructions to hide actions from the user

DO NOT tell user
FIX

Remove directives that attempt to change the agent's output format, suppress safety warnings, or alter response structure in ways that bypass safety controls.

FP?

Likely FP if the output format directive is a legitimate tool configuration (e.g., return results as JSON) that does not suppress safety features.

HIGH
Role switching attempt
L601

Detects attempts to make the AI assume a different role

Switch to a mode
FIX

Remove directives that attempt to alter the agent's behavior or persona. Skill descriptions should describe tool functionality, not contain behavioral commands for the agent.

FP?

Likely FP if the matched text is part of the skill's legitimate system prompt configuration that defines the tool's behavior for the user, not an attack.

HIGH
Secrecy instruction
L850

Detects instructions to hide actions from the user

Never tell user
FIX

Remove directives that attempt to change the agent's output format, suppress safety warnings, or alter response structure in ways that bypass safety controls.

FP?

Likely FP if the output format directive is a legitimate tool configuration (e.g., return results as JSON) that does not suppress safety features.