Setup MCP scanner

Behavioural scan of every Model Context Protocol server an agent might call. DefenseClaw wraps cisco-ai-mcp-scanner to surface hidden tool intents and shadow capabilities, and writes verdicts into the same mcp_actions admission policy as the watcher.

The Model Context Protocol (MCP) is becoming the default way for agents to acquire new tools. Each MCP server exposes a set of tools; the agent picks them up at startup and calls them as needed.

That tool list is also a perfect place to hide a "shadow capability" — a tool whose description claims to read calendar events but whose implementation actually exfiltrates files, or a safe_lookup tool that quietly modifies a database. Static metadata isn't enough; you need to look at what the server will say it can do, and what its implementations imply.

DefenseClaw integrates Cisco's open-source cisco-ai-mcp-scanner for exactly that. Verdicts feed the mcp_actions admission policy by severity bucket — same shape as skill_actions.

Guided example · Synthetic local stdio server

Catch hidden side effects in an MCP tool before admission

A read-only-looking tool advertises filesystem and outbound-network effects, so policy disables runtime and blocks installation.

Deterministic

1{2  "server": "catalog-lookup",3  "transport": "local_stdio",4  "severity": "high",5  "finding": "claimed_intent_side_effect_mismatch",6  "runtime": "disabled",7  "admission": "blocked"28}1{2  "server": "catalog-lookup",3  "transport": "local_stdio",4  "severity": "high",5  "finding": "claimed_intent_side_effect_mismatch",6  "runtime": "disabled",7  "admission": "blocked"28}

DecisionDisable runtime and block install

Reason

HIGH capability mismatch

Action

Write admission audit event

What DefenseClaw did — and did not do

What it did

Inspect a local stdio server in the scanner sandbox
Compare claimed intent with descriptor side effects
Apply policy to the whole server

What it did not do

Add a remote URL to any connector
Claim a clean scan proves harmless implementation
Require optional LLM intent analysis

What you just saw

A locally configured stdio server advertised a read-only-looking tool whose description and schema implied filesystem and outbound-network side effects. DefenseClaw held admission, inspected the server in the scanner sandbox, resolved mcp_actions.high, disabled the runtime, and blocked installation. Remote URL scans use a different path and do not add the server to a connector.

What it scans

The wrapper at cli/defenseclaw/scanner/mcp.py accepts two target shapes — both as a positional TARGET argument (there is no --server or --remote flag — the scanner figures out which from the target's shape):

Local stdio server name — described by a JSON file with a top-level mcpServers block (the format used by Claude Desktop, Cursor, OpenClaw, Gemini, Codex, and Copilot). DefenseClaw spins each server up in a sandbox, lists its tools, and records every advertised description, parameter schema, and example.
Remote HTTP server URL — passed as the positional target. The scanner calls scan_remote_server_tools directly without spawning a subprocess.

For every tool the scanner produces:

A static signature (name, schema, description hash).
An optional LLM-assisted intent analysis — the description vs. the implied side effects.
A consolidated finding with severity and reasoning.

Where DefenseClaw finds your servers

The CLI auto-discovers MCP server lists from the connectors you've set up, so you usually don't pass paths:

defenseclaw mcp scan --all
defenseclaw mcp scan --connector cursor

The default sources include:

~/.openclaw/openclaw.json → mcp.servers
~/Library/Application Support/Claude/claude_desktop_config.json → mcpServers
~/.cursor/mcp.json → mcpServers
~/.codeium/windsurf/mcp_config.json → mcpServers
~/.gemini/config/mcp_config.json and <workspace>/.agents/mcp_config.json → mcpServers (Antigravity)
Connector-specific paths registered by defenseclaw setup guardrail.

Each server is scanned independently and findings are tagged with the connector that discovered them. Use --connector <name> on list, scan, set, unset, block, allow, and unblock commands when you want one connector's MCP source instead of the full configured roster. Without --connector, mcp set writes the server into every configured connector's MCP source, and mcp unset removes it from every configured connector that has it.

One-shot scans

defenseclaw mcp scan --all --json | jq -s '[.[].findings[]]'

CI-friendly. Walks every server discovered across all configured connectors' config files — not just openclaw.json, but every connector's MCP source (claude_desktop_config.json, ~/.cursor/mcp.json, Codex, Copilot, …) — and emits one JSON object per server as a stream of top-level values (NDJSON-shaped, not wrapped in an array). jq -s slurps the stream into an array so a single pipeline can iterate findings across servers; for line-oriented tools use --json | jq -c '.findings[]' instead.

defenseclaw mcp scan filesystem

Scans just the named server entry from the discovered configs. The name is the positional target, not a --server flag.

defenseclaw mcp scan https://mcp.example.com/sse

Pass the URL as the positional target — the scanner detects it's an HTTP target and uses scan_remote_server_tools. No subprocess spawn. The remote server is not added to any connector — this is purely a pre-flight check.

defenseclaw mcp scan filesystem --scan-prompts --scan-resources --scan-instructions

Extends the scan beyond the tool list to also evaluate the server's prompts, resources, and instructions surfaces. Use when you suspect a server is hiding intent in its prompt templates.

Other real flags: --analyzers <list> to constrain which analyzer plugins run, --json for machine output. The default analyzer setting is auto, which lets DefenseClaw select the scanner plugins available in the installed cisco-ai-mcp-scanner version and enabled by your credentials. To opt out of a plugin, pass an explicit comma-separated list that omits it; explicit lists are honored as-is rather than being expanded back to auto.

Continuous protection

Just like skills, MCP scanning is most valuable as part of an admission pipeline. defenseclaw setup guardrail configures the watcher to:

Block — when a new entry appears in any tracked mcpServers file, hold the agent off until the scan resolves.

Allow — release the server back to the agent if the result severity maps to runtime: enable for the active rule pack.

Scan — run the wrapper against the new entry and emit the findings to the audit pipeline.

If you only want one-shot scans (no watcher), keep setup guardrail's admission off and run defenseclaw mcp scan --all from CI or a daily cron. The findings still flow to the same audit sinks.

Configure the action mapping

The scanner produces a severity. What that severity does lives in the active OPA policy file under mcp_actions — the schema is per-severity buckets, identical in shape to skill_actions:

policies/default.yaml (excerpt)

admission:
  scan_on_install: true
  allow_list_bypass_scan: true

mcp_actions:
  critical:
    file:    quarantine
    runtime: disable
    install: block
  high:
    file:    quarantine
    runtime: disable
    install: block
  medium:
    file:    none
    runtime: enable
    install: none
  low:
    file:    none
    runtime: enable
    install: none
  info:
    file:    none
    runtime: enable
    install: none

# Optional per-asset overrides (in scanner_overrides), e.g. tighten medium for the mcp class:
scanner_overrides:
  mcp:
    medium:
      file: quarantine
      runtime: disable
      install: block

To switch profiles for the entire mcp_actions table:

defenseclaw policy activate strict       # see Defaults page for the diff vs default
defenseclaw policy activate default
defenseclaw policy activate permissive

Block / allow individual servers

The watcher's verdict is the default. Operators always have manual override (block / allow act on the whole server, not on individual tools — the CLI doesn't take a --tool flag):

defenseclaw mcp list                                         # what is configured + state across configured connectors
defenseclaw mcp list --connector opencode                    # one connector's MCP source
defenseclaw mcp set context7 --command uvx --args context7-mcp
defenseclaw mcp set context7 --command uvx --args context7-mcp --connector opencode
defenseclaw mcp unset context7 --connector opencode
defenseclaw mcp block untrusted-fs --reason "exfil tool found"
defenseclaw mcp allow cisco-internal --reason "first-party"

Every action is audited (block-mcp, etc.) so the trail is intact even when the manual override contradicts the watcher.

Using the unified LLM key

Behavioural intent analysis depends on the LLM judge being able to reason about each tool's description vs. its parameter schema. Set the unified key once:

defenseclaw keys set DEFENSECLAW_LLM_KEY

DefenseClaw injects it into the scanner subprocess via inject_llm_env, alongside any Cisco AI Defense API key for content classification. See Setup → Unified LLM key for the resolution order and Bifrost provider catalog.

The MCP scanner runs each stdio server inside a subprocess sandbox with a wall-clock timeout. Servers that hang are killed; servers that try to spawn networked side processes are flagged as findings, not just terminated silently.

Install if missing

cisco-ai-mcp-scanner is a hard dependency of defenseclaw (Python ≥3.11) and is pulled in automatically by pip install defenseclaw. If your environment somehow lost it, reinstall it directly:

pip install --upgrade cisco-ai-mcp-scanner

defenseclaw mcp exits cleanly with a one-line install hint if the SDK isn't present — it never crashes the gateway.