Score breakdown
Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.
Why it matters
Useful for AI research teams, agent benchmark authors, and game-AI engineers who need an Apache-2.0-licensed end-to-end agent benchmark that asks whether AI agents can build playable games in a real game engine, so they can reproduce the same scoring on any agent or model release without re-deriving the test harness.
Who should use it
Who should skip it
Skip if the source link, docs, or setup requirements do not match your workflow.
Risk explanation
It is a research benchmark that requires a real game engine environment to run, so review the engine requirements, confirm the scoring harness matches the agent or model release you want to test, and verify the reproducible data is current before publishing comparative results.