Credential Leak

Skills that expose API keys, tokens, passwords, or private keys in their source code or descriptions

What is credential leak?

Credential leaks happen when a skill code or configuration contains hardcoded secrets. API keys, database passwords, private keys, OAuth tokens, webhook secrets. Sometimes these are the skill author own credentials, accidentally committed. Other times, the skill is designed to trick users into providing their credentials in a way that exposes them.

The patterns are well-known in traditional software: a .env file checked into version control, a private key pasted into a config file, an API token hardcoded in a connection string. In the AI skill ecosystem, the same mistakes happen, but with a twist. Skill descriptions and tool definitions are public by design. They are meant to be read by AI agents. So a credential sitting in a skill metadata is immediately accessible to every agent that loads it.

Aguara detects twelve distinct credential patterns, from AWS access keys and GitHub tokens to PEM-encoded private keys and generic high-entropy strings that look like secrets. The scanner checks both the skill definition files and any configuration templates shipped with the skill.

Why this matters for AI agents

When an AI agent loads a skill, it ingests the full description and tool definitions into its context. If credentials are embedded in that text, the agent now "knows" those secrets. Depending on the agent design, those secrets might appear in logs, get sent to other MCP servers the agent is connected to, or leak through the agent responses to the user.

The blast radius is different from a leaked credential in a traditional codebase. A hardcoded API key in a Python file on GitHub might get caught by GitHub secret scanning. But a credential embedded in a skill description on a smaller registry might sit undetected for months, silently loaded into thousands of agent sessions.

There is also the social engineering angle. A skill could instruct the agent to ask the user for credentials, then store or transmit them. This is harder to detect with pattern matching alone, which is why the NLP credential-exfiltration combo rule exists as a cross-category detection.

Real-world examples

A database management skill ships with a default configuration that includes the author database URL and auth token. The author forgot to replace them with placeholders. Anyone who reads the skill definition (or any agent that loads it) now has read-write access to the author production database.

A skill source code contains an AWS access key and secret key pair, hardcoded as fallback values in case environment variables are not set. The skill author intended these as "development-only" credentials, but the AWS account has production S3 access. Automated scanners on other platforms do not catch it because the skill is not hosted on GitHub.

A "password manager" skill asks users to provide their master password through the tool input parameters. The skill description says it stores passwords locally, but the tool definition reveals it transmits inputs to an external API endpoint. The credential never stays local.

How to protect against it

Never put real credentials in skill definitions, configuration templates, or example code. Use placeholder strings like YOUR_API_KEY_HERE or reference environment variables exclusively. If your skill needs credentials to function, document which environment variables or secret stores it reads from, and let the user provide their own.

Before publishing a skill, run a secret scanner on your entire project directory. Tools like trufflehog, gitleaks, or Aguara itself will catch most patterns. Make this part of your CI pipeline so credentials cannot sneak in through a careless commit.

If you are evaluating skills to install, check whether the skill requests credentials through its input parameters rather than environment variables. A well-designed skill never handles raw credentials in its tool interface. It reads them from the environment at runtime, outside the agent context window.

Aguara detection rules (12)

HIGH

AWS access key CRED_002

Detects AWS access key IDs

CRITICAL

GitHub personal access token CRED_003

Detects GitHub personal access tokens

CRITICAL

Private key block CRED_005

Detects PEM-encoded private keys

HIGH

Database connection string CRED_006

Detects database connection strings with credentials

MEDIUM

Slack or Discord webhook CRED_008

Detects Slack and Discord webhook URLs

CRITICAL

GCP service account key CRED_009

Detects Google Cloud Platform service account credentials

MEDIUM

JWT token CRED_010

Detects JWT tokens

MEDIUM

Credential in shell export CRED_011

Detects API keys or tokens exposed in shell export commands

HIGH

Stripe API key CRED_012

Detects Stripe live and test API keys

MEDIUM

CLI credential flags CRED_015

Detects command-line flags passing credentials with real-looking values

MEDIUM

SSH private key in command CRED_016

Detects SSH or SCP commands referencing private key files

LOW

Docker environment credentials CRED_017

Detects Docker or docker-compose commands passing credentials via environment variables

Want to check if your skills have credential leak issues?

Scan now (free, runs in your browser)