Item detail

yaroshevych/desktopctl

yaroshevych/desktopctl is an MIT-licensed local desktop control stack that gives AI agents OCR-backed screen observation plus mouse, keyboard, explicit waits, and post-action verification through a CLI and daemon split instead of fragile one-off coordinate scripts.

Score8.3
Popularity6.7
Riskhigh
TierSilver
Score breakdown
Usefulness8.0
Novelty8.0
Momentum7.0
Maturity6.1
Open-source/build8.4
Evidence7.2
Workflow potential9.8
Setup ease4.2

Popularity is tracked separately. Support, ads, sponsorships, and tips never affect these signals.

Why it matters

Useful for advanced builders testing local computer-use workflows who want a reproducible command surface for desktop actions without sending screenshots to a hosted service.

Who should use it

developers building local computer-use agentsautomation tinkerers who want selector-first desktop actionsteams evaluating local-first alternatives to hosted browser-control stacksadvanced users prototyping screen-driven assistants on macOS

Who should skip it

Skip or sandbox it if you cannot review permissions, data access, and failure modes before use.

Risk explanation

It can observe the screen and drive mouse and keyboard actions on a live machine, so any attached agent needs strict scope control and a non-sensitive test environment.

Evidence links

Closest alternatives / related signals

computer-usedesktop-automationocrlocal-aiagent-tools