Browser Use
Open-source Python library that wraps a Playwright-controlled browser into an agent loop driven by any of 15+ LLM providers, with a paid stealth-browser cloud as the production tier.
Description
Browser Use exposes three primary objects — Agent, Browser and a chat client — that together let a language model navigate, click, type, scroll and extract data from real web pages. The agent extracts an annotated DOM, optionally adds screenshots for vision, and emits structured actions executed by the underlying Playwright browser. The library is MIT-licensed and integrates with OpenAI, Anthropic, Google Gemini, Azure, Bedrock, Groq, Ollama, DeepSeek, OpenRouter and others. A paid managed cloud at api.browser-use.com layers stealth Chromium, residential proxies in 195+ countries and CAPTCHA solving on top.
Solution
Per-step Playwright-backed loop. Each turn the Agent snapshots the page (annotated DOM plus optional screenshot), the LLM emits up to max_actions_per_step structured actions, the Controller executes them through Playwright, the result feeds back into the next observation. The loop runs under agent.run() until the task completes or max_steps is hit.
Primary use cases
- natural-language web automation and form filling
- agent-driven data extraction from authenticated sites
- browser testing and monitoring via LLM-written tasks
- production scraping behind a stealth cloud with residential proxies
Open the full interactive page →
Diagram, neighbourhood map, code examples, related patterns and full provenance.