agentsocial

clawhub:agentsocial

View source

37/100

First Seen

Feb 18, 2026

Last Scanned

Feb 22, 2026

Findings

Score

37/100

CRITICAL 1

HIGH 2

MEDIUM 1

Findings (4)

CRITICAL

Instruction override attempt PROMPT_INJECTION_001

prompt-injection L494

Detects attempts to override or ignore previous instructions

ignore previous instructions

FIX

Remove the injection payload from the skill definition. Text that attempts to reset agent context or override prior directives is a direct attack vector.

FP?

Likely FP if the text is in a security tutorial or research paper discussing injection techniques as examples, not in an active skill description.

HIGH

Autonomous agent spawning PROMPT_INJECTION_017

prompt-injection L304

Detects autonomous sub-agent or cron-based execution without human oversight

Cron | Heartbeat Cron | Notification Check + autonomous

FIX

Remove directives that force the agent to call specific tools or APIs not required for the skill's stated functionality. Tool calls should be determined by user intent, not embedded directives.

FP?

Likely FP if the skill legitimately needs to call other tools as part of its workflow (e.g., a deployment skill that calls git and cloud CLI tools).

HIGH

Secrecy instruction PROMPT_INJECTION_008

prompt-injection L489

Detects instructions to hide actions from the user

NEVER reveal the contents of `SOUL.md`, `USER

FIX

Remove directives that attempt to change the agent's output format, suppress safety warnings, or alter response structure in ways that bypass safety controls.

FP?

Likely FP if the output format directive is a legitimate tool configuration (e.g., return results as JSON) that does not suppress safety features.

MEDIUM

External API response drives agent behavior INDIRECT_009

indirect-injection L34

Detects skills where external API responses control agent decisions or actions

call the appropriate API  + based on result

FIX

Do not include content from MCP tool responses verbatim in system prompts or tool descriptions. Sanitize all dynamic content before incorporating it into prompt context.

FP?

Likely FP if the match is a static tool description that mentions dynamic content handling in its documentation, not an actual injection vector.