Item detail

gy15901580825/Argus

Argus gy15901580825/Argus is an Apache-2.0 black-box red-team testing harness for AI agents — point Argus at any HTTP, gRPC, or browser-using agent endpoint, run 167 adversarial probes (10 hand-authored OWASP LLM Top 10, 5 from public LLM system cards including best-of-N, crescendo, confused-deputy, many-shot jailbreak, sleeper-agent, 30+ browser-agent-specific, Semia-mapped agent-skill detectors

Score7.9
Popularity184.0
Riskmedium
TierSilver
Score breakdown
Usefulness8.3
Novelty9.7
Momentum10.0
Maturity8.2
Open-source/build7.4
Evidence7.2
Workflow potential8.6
Setup ease6.5

Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.

Why it matters

Useful for AppSec and AI-security teams who need a black-box red-team harness for production agent endpoints, security engineering teams that need SARIF output that drops straight into GitHub Code Scanning as a CI gate, AI product teams that need to map findings to OWASP LLM Top 10, MITRE ATLAS, and NIST AI RMF controls without translation, agent builders who need to validate browser-using agents

Who should use it

BuildersPower users

Who should skip it

Skip or sandbox it if you cannot review permissions, data access, and failure modes before use.

Risk explanation

Medium risk: use sandboxing, least privilege, and explicit review before connecting sensitive data or accounts.

Evidence links

Closest alternatives / related signals