Score breakdown
Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.
Why it matters
Useful for agent teams that ship a directory of skills and want a regression suite: pin the benchmark version, add it to CI for the agent under test, and treat drops in skill-level scores as build failures the same way you would for unit tests.
Who should use it
Who should skip it
Skip if the source link, docs, or setup requirements do not match your workflow.
Risk explanation
benchmark scores can be gamed; pair SkillsBench with at least one real user task in your CI.