Item detail
github.com

rzhub/GateMem

rzhub/GateMem is a benchmark toolkit in RepoRadar's Evals & Benchmarks section, holding Gold tier and a 'try now' verdict. Its strongest signal is workflow potential, scored 9.2 out of 10.

Score8.1
Popularity1.0
Risknone
TierGold
Score breakdown
Usefulness7.0
Novelty8.0
Momentum6.0
Maturity6.4
Open-source/build8.4
Evidence8.0
Workflow potential9.2
Setup ease6.4

Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.

Why it matters

Useful for researchers and agent-platform builders who need to measure whether a memory system is merely helpful or actually safe to share across users, teams, or sessions with different rights.

Who should use it

Researchers studying agent memory safety and governanceBuilders designing shared-memory agent products for teams or multi-user workspacesEvaluation teams that want a benchmark before trusting long-lived memory layersDevelopers comparing memory architectures that claim to support deletion or scoped access

Who should skip it

Move on from rzhub/GateMem if the licensing terms, language support, or platform requirements do not fit your project.

About this signal

rzhub/GateMem is tracked by RepoRadar as a benchmark toolkit in the Evals & Benchmarks section. It was first seen on 2026-06-28 and last updated on 2026-06-28. The current verdict is 'try now' with a Gold tier and moderate setup difficulty. The standout signals for rzhub/GateMem are workflow potential (9.2) and open-source/build quality (8.4), while momentum (6.0) trails — that balance shapes where it fits best. This page summarizes the evidence RepoRadar has captured from captured source metadata. The score, tier, risk label, and verdict on this page are never influenced by sponsorship, ads, or tips — they reflect only the usefulness, popularity, novelty, momentum, maturity, and evidence signals described in the RepoRadar methodology.

How this item is evaluated

RepoRadar assigned rzhub/GateMem a composite score of 8.1 out of 10, placing it in the Gold tier. This score combines weighted sub-signals: usefulness (35%), novelty (18%), momentum (14%), maturity (10%), open-source/build quality (7%), evidence quality (6%), workflow potential (6%), and setup ease (4%). Popularity is tracked separately at 1.0 and never affects the composite score or tier. The risk label of 'none' reflects inherent user-impacting hazards, not generic novelty. Items with no risk flag may still require normal code review before production use.

Putting this into practice? Read How to read AI benchmarks without getting fooled for the checklist behind this score.

Risk explanation

No inherent user-impacting risk is flagged from the captured evidence.

Evidence links
Closest alternatives / related signals
evalsbenchmarkagent-memoryresearchmitshared-memory