evaluating-llms-harness

skills-sh:davila7_claude-code-templates__evaluating-llms-harness

View source
A
92/100

First Seen

Feb 18, 2026

Last Scanned

Feb 20, 2026

Findings

5

Score

92/100

MEDIUM 1
LOW 4

Findings (5)

MEDIUM
Dynamic code evaluation
L293

Detects eval() or exec() used for dynamic code execution

eval (t
FIX

Replace eval()/exec() with a safer alternative such as json.loads(), ast.literal_eval(), or a purpose-built parser.

FP?

Likely FP if the matched text contains 'exec' as part of a word (e.g., 'execute', 'execution') rather than an actual eval() or exec() call.

LOW
pip install arbitrary package
L17

Detects pip install of arbitrary packages that modify the host environment

pip
install
lm
FIX

Pin all pip packages to exact versions (e.g., pip install package==1.2.3). Use a requirements.txt or pyproject.toml with pinned versions and hash verification.

FP?

Likely FP if the match is in documentation showing how to install the skill's own PyPI package.

LOW
Shell script file execution
L323

Detects execution of shell script files via bash/sh command or direct invocation

./eval_checkpoint.sh
FIX

Replace direct shell script execution with a language-native implementation or a sandboxed executor. If shell scripts must run, restrict them to a vetted allowlist with integrity checks.

FP?

Likely FP if the match references running a script that is part of the skill's own repository (e.g., ./setup.sh) with clear, auditable contents.

LOW
pip install arbitrary package
L742

Detects pip install of arbitrary packages that modify the host environment

pip
install
vl
FIX

Pin all pip packages to exact versions (e.g., pip install package==1.2.3). Use a requirements.txt or pyproject.toml with pinned versions and hash verification.

FP?

Likely FP if the match is in documentation showing how to install the skill's own PyPI package.

LOW
pip install arbitrary package
L877

Detects pip install of arbitrary packages that modify the host environment

pip
install
hu
FIX

Pin all pip packages to exact versions (e.g., pip install package==1.2.3). Use a requirements.txt or pyproject.toml with pinned versions and hash verification.

FP?

Likely FP if the match is in documentation showing how to install the skill's own PyPI package.