Item detail

HolmesGPT/holmesgpt

HolmesGPT/holmesgpt is an Apache-2.0-licensed, CNCF Sandbox open-source AI agent for investigating production incidents and finding root causes across Kubernetes, VMs, cloud providers, databases, and SaaS platforms, with a new Operator Mode that runs 24/7 in the background, spots problems before customers notice, and messages the on-call in Slack with a fix that can open PRs through the GitHub int

Score8.5
Popularity8.7
Risklow
TierGold
Score breakdown
Usefulness9.0
Novelty8.0
Momentum9.0
Maturity6.8
Open-source/build8.4
Evidence7.2
Workflow potential10.0
Setup ease6.4

Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.

Why it matters

Useful for SRE teams, platform engineers, and on-call rotation leads who need an Apache-2.0-licensed, CNCF Sandbox open-source AI agent for investigating production incidents and finding root causes across Kubernetes, VMs, cloud providers, databases, and SaaS platforms, with a new Operator Mode that runs 24/7 in the background, spots problems before customers notice, and messages the on-call in Sl

Who should use it

SRE teams who need an Apache-2.0-licensed, CNCF Sandbox open-source AI agent that investigates production incidents and finds root causes across Kubernetes, VMs, cloud providers, databases, and SaaS platformsplatform engineers who want a 24/7 Operator Mode that runs in the background, spots problems before customers notice, and messages the on-call in Slack with a fix that can open PRs through the GitHub integrationon-call rotation leads who need a CNCF-grade incident-investigation agent that scales to petabyte-scale observability data with server-side filtering and memory-safe executionopen-source contributors who want an Apache-2.0-licensed alternative to closed-source on-call copilots

Who should skip it

Skip if the source link, docs, or setup requirements do not match your workflow.

Risk explanation

It is an Apache-2.0-licensed CNCF Sandbox AI agent that investigates production incidents and can open PRs through the GitHub integration, so review which data sources the agent is allowed to read, scope which clusters the agent is deployed in, confirm that Operator Mode runs in dry-run before granting it real cluster permissions, and gate any production rollout behind a security review before allowing the agent to open PRs against production repositories.

Evidence links

Closest alternatives / related signals

sreincident-responseai-agentkubernetesroot-cause-analysisoperator-modecncfopen-source