SR
SkillRadar
Agent security + benchmarks
Menu
Trust Report v1

openclaw-qa-testing

Run, watch, debug, extend, or explain OpenClaw qa-lab and qa-channel scenarios, artifacts, and live lanes.

Overall
63
Trust
29
Utility
88
Momentum
95

Install caution

High-risk behavior present

Risk: High

Source: OpenClaw Built-in Skills

Path: .agents/skills/openclaw-qa-testing/SKILL.md

Review flags: browser/session access, credential or secret references, filesystem/home-directory access, network access or external URLs. These are review signals, not definitive security judgments; inspect before installing.

Required permissions

  • Environment variables / secrets
  • Shell commands
  • Network/API usage
  • Filesystem/home access
  • Browser/session access

Permissions are inferred from SKILL.md text only. They are review prompts, not guarantees about runtime behavior.

Risk flags explained

browser_or_session_accessmedium

Mentions browser automation, cookies, sessions, local storage, or browser state.

credential_or_secret_referencehigh

Mentions tokens, API keys, passwords, or private-key style environment variables.

filesystem_write_or_home_accessmedium

Mentions filesystem writes, deletes, home-directory paths, or config/key locations.

network_accessmedium

Mentions external URLs, network APIs, downloads, or HTTP client usage.

shell_commandmedium

Contains shell command snippets. Review commands before copy/paste or agent execution.

Score explanation

Trust

  • Trust starts at 90 before review-signal penalties and metadata bonuses.
  • Risk-signal penalty: -63 from 5 detected flag(s).
  • Metadata bonus: +2 from author/version/description fields.

Utility

  • Utility starts at 55 and rewards clear descriptions, runnable examples, and explicit setup needs.
  • Description present: yes.
  • Command examples detected: 13.
  • Environment variables detected: 6.

Momentum

  • Momentum starts at 45 and uses public repo activity signals.
  • Recent commit activity: latest repo update was 0 day(s) ago.
  • Recent commit volume: 100 commit(s) in the lookback window (+20).
  • Source has strong public adoption: 368598 stars.
  • Fork activity suggests reuse: 75946 forks.

Overall

  • Overall score weights trust 45%, utility 35%, and momentum 20%.

Detected signals

Env vars

  • OPENAI_API_KEY
  • OPENCLAW_LIVE_OPENAI_KEY
  • OPENCLAW_QA_CONVEX_SECRET_CI
  • OPENCLAW_QA_CONVEX_SECRET_MAINTAINER
  • OPENCLAW_QA_TELEGRAM_DRIVER_BOT_TOKEN
  • OPENCLAW_QA_TELEGRAM_SUT_BOT_TOKEN

Commands

  • gh api repos/openclaw/openclaw/actions/runs/<run-id>/artifacts
  • gh run view --json artifacts
  • gh workflow run "NPM Telegram Beta E2E" --repo openclaw/openclaw --ref main -f package_spec=openclaw@YYYY.M.D-beta.N -f package_label=openclaw@YYYY.M.D-beta.N -f provider_mode=mock-openai
  • openclaw-qa
  • pnpm openclaw qa character-eval --model openai/gpt-5.4,thinking=xhigh --model openai/gpt-5.2,thinking=xhigh --model openai/gpt-5,thinking=xhigh --model anthropic/claude-opus-4-6,thinking=high --model anthropic/claude-sonnet-4-6,thinking=high --model zai/glm-5.1,thinking=high --model moonshot/kimi-k2.5,thinking=high --model google/gemini-3.1-pro-preview,thinking=high --judge-model openai/gpt-5.4,thinking=xhigh,fast --judge-model anthropic/claude-opus-4-6,thinking=high --concurrency 16 --judge-concurrency 16 --output-dir .artifacts/qa-e2e/character-eval-<tag>
  • pnpm openclaw qa manual --model codex-cli/<codex-model> --message "Reply exactly: CODEX_OK"
  • pnpm openclaw qa matrix
  • pnpm openclaw qa matrix --profile fast --fail-fast
  • pnpm openclaw qa suite --provider-mode live-frontier --model codex-cli/<codex-model> --alt-model codex-cli/<codex-model> --scenario <scenario-id> --output-dir .artifacts/qa-e2e/codex-<tag>
  • pnpm openclaw qa suite --provider-mode live-frontier --model openai/gpt-5.4 --alt-model openai/gpt-5.4 --output-dir .artifacts/qa-e2e/run-all-live-frontier-<tag>
  • pnpm qa:otel:smoke
  • pnpm test:docker:npm-telegram-live

URLs

  • http://127.0.0.1:<port

Watch this skill

Get alerted when this skill adds credential requirements, shell commands, external domains, remote installer patterns, or risk-level changes.

Join watchlist beta

Methodology note

SkillRadar scans SKILL.md as hostile text only. It does not execute commands, install packages, or load third-party skills.