Skyvern
Automate browser-based workflows with AI using vision LLMs instead of brittle selectors.
Description
Playwright-compatible browser automation that uses vision LLMs plus computer vision to interact with websites without per-site XPath/CSS selectors, adapting to layout changes and sites it has never seen. Ships a no-code workflow builder alongside the SDK; reports strong WebVoyager and form-filling results.
Solution
A vision-LLM observes the rendered page, decides the next interaction, and acts via a Playwright-compatible layer — looping perceive→decide→act without relying on code-defined selectors.
Primary use cases
- automating browser workflows across unfamiliar or changing sites
- form-filling and data-entry automation without per-site selectors
- no-code browser automation for non-technical users
Open the full interactive page →
Diagram, neighbourhood map, code examples, related patterns and full provenance.