Item detail

promptfoo/promptfoo

promptfoo is an MIT-licensed CLI and library for evaluating prompts, agents, and RAG systems, with declarative test configs, provider comparisons, red-team checks, and CI-friendly workflows for catching quality and safety regressions before release.

Score8.7
Popularity80.0
Riskconditional
TierGold
Score breakdown
Usefulness9.0
Novelty8.0
Momentum8.0
Maturity8.4
Open-source/build8.4
Evidence7.2
Workflow potential9.8
Setup ease6.4

Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.

Why it matters

Useful for teams that need a practical way to test AI behavior repeatedly instead of relying on one-off manual prompt checks.

Who should use it

LLM app teamsAI product engineerssecurity-minded buildersdevelopers maintaining RAG or agent workflows

Who should skip it

Skip if the source link, docs, or setup requirements do not match your workflow.

Risk explanation

Evaluation runs can forward prompts, datasets, and model outputs to external providers, so scrub sensitive test data before using hosted model backends..

Evidence links

Closest alternatives / related signals

llm-evalsred-teamingprompt-testingragdeveloper-tools