Score breakdown
Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.
Why it matters
Useful for security teams and engineering teams that need an autonomous vulnerability discovery + remediation loop and want Anthropic's reference implementation as a starting point (not a vendor lock-in — the harness is configurable for any Claude API, including Bedrock, Vertex, and Azure): the reference pipeline runs in a gVisor sandbox by default so the autonomous patch step can execute target c
Who should use it
Who should skip it
Skip anthropics/defending-code-reference-harness for now if your priority is a tool you can use today without configuring a build pipeline or development environment.
About this signal
anthropics/defending-code-reference-harness is tracked by RepoRadar as a anthropic's reference harness fo in the Apache-2.0 reference implementation from Anthrop section. It was first seen on 2026-06-25 and last updated on 2026-06-25. The current verdict is 'try now' with a Gold tier and hard setup difficulty. anthropics/defending-code-reference-harness leads on workflow potential (9.5) and practical usefulness (9.0); its lowest signal is setup ease (4.2), so factor that in before investing setup time. This page summarizes the evidence RepoRadar has captured from captured source metadata. The score, tier, risk label, and verdict on this page are never influenced by sponsorship, ads, or tips — they reflect only the usefulness, popularity, novelty, momentum, maturity, and evidence signals described in the RepoRadar methodology.
How this item is evaluated
RepoRadar assigned anthropics/defending-code-reference-harness a composite score of 8.4 out of 10, placing it in the Gold tier. This score combines weighted sub-signals: usefulness (35%), novelty (18%), momentum (14%), maturity (10%), open-source/build quality (7%), evidence quality (6%), workflow potential (6%), and setup ease (4%). Popularity is tracked separately at 6199.0 and never affects the composite score or tier. The risk label of 'low' reflects inherent user-impacting hazards, not generic novelty. Items with no risk flag may still require normal code review before production use.
Risk explanation
**Reference, not a product; the harness will not work on every codebase out of the box.** The README is explicit: 'This harness is a reference, not a product. The general shape, prompts, and sandboxing are reusable, but the harness will not work on every codebase out of the box.' Run `/customize` to port it to the team's language, detector, or vuln class. Adopters who want a turnkey product should evaluate Claude Security (the managed option Anthropic ships) instead; **Autonomous pipeline executes target code; gVisor sandbox is the security boundary.** The autonomous reference pipeline (including `/patch` on pipeline results) executes target code, so it refuses to run outside of a gVisor sandbox unless explicitly overridden. The `scripts/setup_sandbox.sh` helper sets up the sandbox once, then the pipeline is invoked via `bin/vp-sandboxed`. Adopters who skip the sandbox setup and run `bin/vp` (or whatever the unsandboxed entry point is) directly are running the patch step on the host without the gVisor boundary — read `docs/security.md` and `docs/agent-sandbox.md` before overriding; **Repo is not maintained; fork, customize, and run your own pipeline.** The README is explicit that 'this repo is not maintained and is not accepting contributions.' Adopters should treat the patterns as the product, fork the repo, customize it for their own language / detector / vuln class, and run their own pipeline on the shape Anthropic ships. Bugs and security issues in the reference are not getting fixes; the team's port is on the team.