Item detail

zimingttkx/QuantumFlow

QuantumFlow zimingttkx/QuantumFlow is an MIT distributed LLM inference scheduling platform that turns a GPU cluster into a multi-user, multi-model, multi-business inference service, ships five entry points (REST / gRPC / Python SDK / CLI / Web Playground), three scheduling strategies that auto-switch (Gang / All-or-Nothing for large models, Pack / shared for small models, Adaptive / AI-selected),

Score7.8
Popularity188.0
Risklow
TierSilver
Score breakdown
Usefulness8.2
Novelty9.8
Momentum10.0
Maturity8.2
Open-source/build7.4
Evidence7.2
Workflow potential8.5
Setup ease6.5

Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.

Why it matters

Useful for ML infrastructure engineers, AI engineers, inference engineers, platform engineers, ops engineers, and teams that run a multi-GPU cluster and want one inference platform that supports multiple backends, multiple users, multiple models, and per-tenant billing, because zimingttkx/QuantumFlow is an MIT distributed LLM inference scheduling platform that turns a GPU cluster into a multi-user

Who should use it

BuildersPower users

Who should skip it

Skip if the source link, docs, or setup requirements do not match your workflow.

Risk explanation

Risk label needs manual review.

Evidence links

Closest alternatives / related signals