50x faster than
Claude Computer Use

Local HTTP server that gives any AI agent eyes and hands on Windows. Screen capture, mouse, keyboard, OCR, and UI Automation. All on localhost, all through a simple REST API.

View on GitHub
Download eyehands.exe — extract, run, done.
7-day free trial, all features included. No Python required.

Measured, not claimed

Head-to-head on the same Windows machine, April 2026

OperationeyehandsClaude Computer Use
Screenshot57ms2-4 seconds~50x
5-action batch67ms3-5 seconds~50x
Screenshot resolution1920x10801456x8161.7x
Text input89ms native SendInputClipboard paste
OCR text searchBuilt-inNot availableUnique
UI AutomationBuilt-inNot availableUnique

What you get

Eyes

Screen capture at ~20fps into a rolling buffer. On-demand screenshots in 57ms. BetterCam DXGI backend. Multi-monitor. Per-Monitor DPI v2. Region and window-specific capture.

Hands

Mouse and keyboard via Windows SendInput. Goes through the Raw Input pipeline, so games, Parsec, and pointer-lock apps see the input correctly. Absolute and relative movement. Batch up to 100 actions in one call.

Understanding

OCR text search via EasyOCR with frame-level caching. Windows UI Automation tree walking. Find and click elements by name instead of pixel coordinates.

Speed

Everything on localhost. No cloud round-trips. Frame-hash ETag polling for zero-token screen watching. Cached OCR returns instantly on unchanged frames.

Building Godot games? (save $20)

Works with everything

Any AI agent that can make HTTP calls. Python SDK, MCP server, OpenAI function calling, or raw curl.

Claude Code OpenAI / GPT Cursor Local LLMs Any HTTP client

One price. No subscription.

$49

One-time payment. 7-day free trial first.

  • All endpoints, no restrictions
  • Screen capture, mouse, keyboard
  • OCR text search + UI Automation
  • Batch actions, smooth movement
  • Python SDK + MCP server
  • Per-machine activation
  • Lifetime updates

Use cases