Item detail

noonghunna/club-3090

RepoRadar surfaced noonghunna/club-3090 — a rtx 3090/4090/5090 community llm — into the Apache-2.0 community recipes for serving LLMs on section, where it sits at Gold tier with a 'try now' verdict. Its strongest signal is workflow potential, scored 9.6 out of 10.

Score8.5
Popularity1468.0
Risklow
TierGold
Score breakdown
Usefulness9.0
Novelty8.0
Momentum8.0
Maturity9.1
Open-source/build8.4
Evidence7.2
Workflow potential9.6
Setup ease6.4

Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.

Why it matters

Useful for home-lab developers and small teams running one or two RTX 3090s (or 4090/5090) who want a measured, reproducible LLM-serving stack without picking vLLM vs llama.cpp vs ik_llama from first principles: club-3090 is the Apache-2.0 community recipes repo that ships validated Docker compose variants for each (vLLM for max throughput, llama.cpp for max context + cliff-immunity, ik_llama for

Who should use it

Home-lab developers and small teams running one or two RTX 3090s (or 4090/5090) who want a measured, reproducible LLM-serving stack without picking vLLM vs llama.cpp vs ik_llama from first principlesUsers who already have a 3090 and a modern 27B-class model in mind (Qwen3.6-27B, Gemma 4 31B, Qwen3.6 35B-A3B, Gemma 4 26B-A4B) and want the maintainer's measured TPS numbers + prefill-cliff diagnosis to pick the right config up frontTeams who want a reproducible benchmark + soak-test pipeline (`bash scripts/rebench-full.sh`) that produces per-model TPS reports under `results/rebench/<tag>/` so cross-rig comparisons are apples-to-applesUsers who want a lazydocker-style cockpit (`c3`) to monitor live GPU + scene + container state during a long-running inference sessionEngineers adopting the `beellama.cpp` community fork on single-card 24GB Ampere because mainline vLLM is blocked on the head_dim=512 FA wall (the README publishes a measured single-card workaround table)Users who want the universal `pull` flow to evaluate any safetensors HF repo with an honest one-line fit verdict (`--recommend`) instead of guessing quantization + VRAM headroomWindows users who want a documented WSL2 setup path (`docs/WSL_SETUP.md` is start-to-finish) before deploying the Docker stackCross-rig sharing where a triage report generator (`report.sh`) captures hardware + stack + boot-log highlights as redacted markdown safe to paste into a public issue without leaking home paths or HF tokensAdopters who want the Image Studio bundle (`docs/ai-studio/image.md`) to run Ideogram-4 image gen + a chat model + Open WebUI together on two GPUs with one command (`bash scripts/setup-image-studio.sh`)

Who should skip it

Skip noonghunna/club-3090 unless the captured evidence suggests it solves a problem you are actively working on.

About this signal

noonghunna/club-3090 is tracked by RepoRadar as a rtx 3090/4090/5090 community llm in the Apache-2.0 community recipes for serving LLMs on section. It was first seen on 2026-06-25 and last updated on 2026-06-25. The current verdict is 'try now' with a Gold tier and moderate setup difficulty. The standout signals for noonghunna/club-3090 are workflow potential (9.6) and maturity (9.1), while setup ease (6.4) trails — that balance shapes where it fits best. This page summarizes the evidence RepoRadar has captured from captured source metadata. The score, tier, risk label, and verdict on this page are never influenced by sponsorship, ads, or tips — they reflect only the usefulness, popularity, novelty, momentum, maturity, and evidence signals described in the RepoRadar methodology.

How this item is evaluated

RepoRadar assigned noonghunna/club-3090 a composite score of 8.5 out of 10, placing it in the Gold tier. This score combines weighted sub-signals: usefulness (35%), novelty (18%), momentum (14%), maturity (10%), open-source/build quality (7%), evidence quality (6%), workflow potential (6%), and setup ease (4%). Popularity is tracked separately at 1468.0 and never affects the composite score or tier. The risk label of 'low' reflects inherent user-impacting hazards, not generic novelty. Items with no risk flag may still require normal code review before production use.

Risk explanation

**Linux + Docker-first stack; Windows requires WSL2.** The repo's primary deployment target is Linux + Docker compose; the README is explicit that native Windows runs only the upstream llama.cpp binary — none of the repo's tooling. Windows users must follow `docs/WSL_SETUP.md` start-to-finish before the Docker steps. macOS is documented but most measured TPS numbers come from Linux + Ampere/Ada/Blackwell NVIDIA hardware; Apple Silicon users should not expect cross-rig reproducibility with the published numbers; **Ampere 24GB prefill-cliff wall on single-card vLLM at >50K context.** The README is explicit that on 24GB single-card vLLM, prefill OOMs at >~50K single-prompt context because of the head_dim=512 FA wall (Genesis v7.72.2 PN59 was intended as the fix but does not engage on chunked-prefill). Workarounds are measured and documented: `vllm/dual` with TP=2 escapes it, or `llamacpp/default` (different engine, no cliff). Verify the team's context-length requirements against the published Cliff table (`docs/CLIFFS.md`) before adopting single-card vLLM for long-context workloads; **Community fork (`beellama.cpp`) is unofficial and unvalidated on sm_89/sm_120.** For Gemma 4 31B on Ampere 24GB single-card, the README ships the `beellama` config with the explicit warning that the `beellama.cpp` community fork builds with `FA_ALL_QUANTS=ON` and the multi-arch image is unvalidated on sm_89/120 — users adopting this path should pin the commit and re-run the published benchmarks on their own hardware, or prefer `ik-llama/iq4ks-mtp` if the single-card code-fast profile is not critical.

Evidence links

Closest alternatives / related signals

club-3090noonghunna309040905090rtx-3090rtx-4090rtx-5090