Item detail

run-llama/ParseBench

ParseBench is a document-parsing benchmark built around roughly 2,000 human-verified enterprise pages and five failure dimensions that matter when agents need reliable structured output from PDFs.

Score8.4
Popularity70.0
Risknone
TierGold
Score breakdown
Usefulness8.0
Novelty7.0
Momentum8.0
Maturity8.0
Open-source/build8.4
Evidence7.2
Workflow potential8.8
Setup ease6.4

Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.

Why it matters

Useful for teams choosing or validating a parsing stack before they wire it into RAG, extraction, or document automation pipelines.

Who should use it

RAG buildersdocument AI teamsdevelopers evaluating OCR and parsing vendorsresearchers comparing structured extraction tools

Who should skip it

Skip if the source link, docs, or setup requirements do not match your workflow.

Risk explanation

No inherent user-impacting risk is flagged from the captured evidence.

Evidence links

Closest alternatives / related signals

document-parsingbenchmarkpdfragevaluation