Forensic failures
Near-miss candidates, console errors, and a screenshot path — enough to repair in one agent turn.
Local browser automation for MCP clients
Reflex runs locally in Claude Desktop, Cursor, Claude Code, or any MCP client. Batch a whole flow into about 2 guarded calls per task (6 across three measured flows vs 14 with Playwright MCP), send ~13K tokens into context instead of ~56K, and never let a page leave your machine.
For AI assistants with browser tools: research products, fill forms, check prices, scrape tables, and compare options without burning your context on page dumps.
No card. 30 free credits (~~10-15 tasks). 2 minutes to first task.
Wikipedia: search an article, open its revision history
Drop Reflex into your client in one click.
One click in your client, then reflex_auth to sign in.
Getting started
Benchmarks
Numbers from our published head-to-head, run 2026-06-12 against @playwright/mcp 0.0.76 on live sites, no LLM on either side. Every number, and how to reproduce it, is on the benchmarks page.
| Playwright MCP | Reflex | |
|---|---|---|
| Tool calls, 3 measured flows | 14 | 6 |
| Tokens into context, same flows | ~56K | ~13K |
| Flows completed, of 6 attempted | 3 | 5 |
| One look at a Wikipedia article | ~31K tokens | ~4K tokens |
| One look at the worst page tested (W3C CSS spec) | ~169K tokens | ~7K tokens |
| Re-reading an unchanged page | ~16K tokens | 91 characters: "(page unchanged)" |
Measured 2026-06-12. See every number and how to reproduce it.
Pure MCP clients like Claude Desktop pay the inline snapshot price above on every look. Reflex distills the page instead, so your agent keeps its context window for the actual work. Playwright MCP is free and better for iframes, network mocking, and Firefox/WebKit; Reflex wins when your client pays per snapshot in context. Full methodology and the reproduce command are on the benchmarks page.
Agent-native
Near-miss candidates, console errors, and a screenshot path — enough to repair in one agent turn.
Expectations checked inside a single guarded call, so a flow lands or fails loudly without extra round trips.
Saved flows re-run with no model in the loop and self-heal when the page shifts.
Watch the real run
Privacy
Reflex runs entirely on your computer: the MCP server, the browser, your logins, every page you visit. The only things that ever cross the network are your API key and a count of tool calls, so we can meter credits. We could not read your pages, sessions, or passwords if we wanted to; the architecture has no path for them.
Compare that to cloud browser products, where every page you visit renders on someone else's server.
No path exists for your pages, logins, or sessions to reach us.
flows completed on the same 6 live-site tasks (first-try success)
per page look vs Playwright MCP, averaged across 12 live sites
to re-run a saved flow (deterministic replay, no model in the loop)
Pricing
For occasional flows.
Best value for steady use.
For heavy, daily automation.
FAQ
Ready when you are
Installed in 2 minutes. Your pages stay yours.