All articles
GuidesFebruary 19, 2026· 10 min read
Eight ways to build a browser agent
A tour of the patterns teams use to put a real browser behind an LLM.
PN
Priya N.
Developer Relations
There is no single right way to build a browser agent. Here are eight patterns we see in the wild, from the simplest to the most ambitious.
- Prompt-to-action: let the model emit clicks and types directly
- Tool-calling: expose navigation and extraction as functions
- DOM-to-JSON: hand the model a cleaned page and ask for data
- Vision-first: screenshot, reason, act
- Record-and-replay: capture a flow once, parameterize it
- Skill-based: call named, versioned tasks
- Human-in-the-loop: pause for approval on risky steps
- Fleet: run many of the above in parallel
Pick by failure mode
Choose the pattern that fails in a way you can tolerate. A crawl can afford to skip a page; a checkout flow cannot. Ferr gives you the same browser under all eight.
Build it on Ferr
Launch your first cloud browser for free.
Keep reading
All articlesGuidesApril 23, 2026
Building a durable research agent on Ferr
A walkthrough of an agent that researches for hours, survives restarts, and never loses its place.
Read article8 min read
GuidesMarch 26, 2026
A beginner's guide to Ferr
From zero to your first cloud browser in a few minutes, with the code to copy.
Read article6 min read
GuidesFebruary 5, 2026
Give Claude Code a real browser
Add Ferr as a tool so your coding agent can open pages, click, and read the live web.
Read article6 min read