Framework · Browser & Computer-Use

Skyvern

Automate browser-based workflows with AI using vision LLMs instead of brittle selectors.

Description

Playwright-compatible browser automation that uses vision LLMs plus computer vision to interact with websites without per-site XPath/CSS selectors, adapting to layout changes and sites it has never seen. Ships a no-code workflow builder alongside the SDK; reports strong WebVoyager and form-filling results.

Solution

A vision-LLM observes the rendered page, decides the next interaction, and acts via a Playwright-compatible layer — looping perceive→decide→act without relying on code-defined selectors.

Primary use cases

  • automating browser workflows across unfamiliar or changing sites
  • form-filling and data-entry automation without per-site selectors
  • no-code browser automation for non-technical users

Open the full interactive page

Diagram, neighbourhood map, code examples, related patterns and full provenance.

Related