Score breakdown
Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.
Why it matters
Useful for researchers, podcasters, journalists, and product teams who need speaker-attributed transcripts from multi-speaker audio (interviews, meetings, podcasts, call-center recordings) where standard ASR produces a flat text stream without speaker labels.
Who should use it
Who should skip it
Skip if the source link, docs, or setup requirements do not match your workflow.
Risk explanation
250 stars and pushed 2026-06-04 — research-track, not a production-hardened SaaS; benchmark on your own audio before depending on it; Pretrained checkpoints cover English + Mandarin; other languages require fine-tuning on labeled data.