
browser-act cli
β 3,500from browser-act
Help your AI agent finish more browser tasks.
Browser automation CLI built for AI agents. Get past anti-bot walls, hand off to humans across platforms when stuck, run parallel tasks without cross-contamination, and isolate multiple accounts in independent browsers.
Why BrowserAct
The browser an AI agent needs has to reach places standard tools can't, let a human seamlessly take over when the agent is stuck, keep parallel tasks from cross-contaminating, and be designed for LLM reasoning β not human-written scripts. A browser for agents must get four things right.
1. Break through blocks β three progressive layers
- Environment layer β stealth fingerprint spoofing, TLS rotation, proxy switching. The vast majority of blocks never trigger.
- Execution layer β
solve-captchaauto-solves CAPTCHAs;stealth-extractpulls protected pages in one command. - Human layer β
remote-assistgenerates a live URL; the user takes over from any device, and the agent continues seamlessly when done.
2. Three browser modes β by real-world scenario
| Mode | Scenario | Key trait |
|---|---|---|
chrome | Reuse local Chrome login state | Profile import or CDP attach |
stealth privacy mode | Frictionless batch scraping without login | Fresh fingerprint per session + proxy rotation, zero residue |
stealth fixed identity | Logged-in accounts Β· multi-browser parallel | Stable fingerprint + stable IP, stable account identity, not flagged as bots |
3. Zero-interference concurrency β every agent in its own lane
- Cross-browser parallel β independent cookies, fingerprints, proxies. Sites cannot correlate them.
- Same-browser multi-session β shared login state, independent execution, tasks don't block each other.
- Privacy mode β fresh fingerprint and empty profile per session, zero residue when done.
4. Designed for agent reasoning β not human scripts
- Compact text output β indexed text format, several times more token-efficient than JSON or HTML.
- Indexed interaction β
statereturns an indexed list;click 3/input 2 "...". No DOM parsing required. - Semantic memory β every browser carries a
desc, matched to tasks by meaning. - Concurrency-safe β session ownership + explicit naming. Multi-agent operation never conflicts.
Security: confirmation gating β sensitive operations (browser create / delete, Profile import, proxy changes, security and privacy toggles) require explicit user approval. Prior approvals do not carry over. Enforced at the Skill layer, not a configuration toggle.
And More
- Better headless β Default headless without disrupting users; stealth headless that isn't detected.
- Cross-platform remote handoff β Any device opens the link to take over, and the agent continues seamlessly.
Compatibility
OS: Windows, macOS, Linux
Agents: Claude Code Β· Cursor Β· VS Code Β· OpenCode Β· OpenClaw Β· Codex Β· Gemini CLI β works with any agent that can execute shell commands and load Skills.
What's Free
Almost everything is free. Only two features require payment: managed proxies (Dynamic / Static), and stealth browsers beyond the first 5.
| Feature | Free<br>(NoΒ Signup) | Free<br>(LoginΒ Only) | Paid |
|---|---|---|---|
| Browser automation, Chrome / Chrome-direct | β | β | β |
| Stealth browser (β€ 5), stealth-extract, solve-captcha, remote-assist, privacy mode, Skill Forge | β | β | β |
| Stealth browser (> 5), Dynamic / Static proxy | β | β | β |
Documentation
Full documentation covers anti-blocking, browser modes, sessions and concurrency, headless and remote handoff, agent design, the Skills system, and the complete command reference.
Read the full documentation β
Also From BrowserAct
Skill Forge β Your Personal Scraping Engineer
Need to extract data from the same website repeatedly at scale? Don't write scrapers by hand. Skill Forge explores a site once, discovers its APIs and data patterns, generates a deploy-ready Skill package, then runs reliably without re-exploration β 500 or 5,000 records through the same stable path.
Any website. Any data. One command to start:
Install browser-act-skill-forge. Skill source: https://github.com/browser-act/skills/tree/main/browser-act-skill-forge . Verify it works after installation.
Then tell your agent what you need:
"Forge a Skill that extracts job listings from LinkedIn β title, company, salary, URL. I'll run 300 keywords later."
Solutions Catalog
30+ pre-built Skills already generated by Skill Forge, ready to install and run. Covers Amazon, Google Maps, YouTube, Reddit, WeChat, Zhihu, and more.
Browse the full Solutions Catalog β
Build Your Own
Can't find what you need above? Generate a custom Skill for any website in minutes β no coding required. Just describe what data you want or what action to perform, and Skill Forge handles the rest.
π Support the Project
BrowserAct Skills is free and open source. If it saves you time, please give us a β Star β it keeps the project alive and helps us ship more skills.
<a href="https://github.com/browser-act/skills/stargazers"> <img src="https://img.shields.io/github/stars/browser-act/skills?style=social" alt="GitHub Stars"> </a>π Bonus: Once you star the repository, you can join our Discord and post in the #claim-500-credits channel to receive 500 free credits!
π€ Community & Support
- π¬ Join our Discord
- π Read the Docs
- π Report an Issue
- π BrowserAct Website
Star History
<a href="https://www.star-history.com/?repos=browser-act%2Fskills&type=date&logscale=&legend=top-left"> <picture> <source media="(prefers-color-scheme: dark)" srcset="https://api.star-history.com/chart?repos=browser-act/skills&type=date&theme=dark&legend=top-left" /> <source media="(prefers-color-scheme: light)" srcset="https://api.star-history.com/chart?repos=browser-act/skills&type=date&legend=top-left" /> <img alt="Star History Chart" src="https://api.star-history.com/chart?repos=browser-act/skills&type=date&legend=top-left" /> </picture> </a>Install
Tell your AI agent:
Install browser-act. Skill source: https://github.com/browser-act/skills/tree/main/browser-act . Verify it works after installation.
Quick Start
# Extract protected page content (zero config)
browser-act stealth-extract https://example.com
# Full browser automation
browser-act --session my-task browser open <id> https://example.com
browser-act --session my-task state # See clickable elements
browser-act --session my-task click 3 # Click by index
browser-act --session my-task input 2 "hi" # Type into a fieldMore examples and workflows β
The agent runs get-skills at the start of each session β gets environment state, browser list, and commands in one call:
browser-act get-skills core --skill-version 2.0.2How agents discover and use BrowserAct β
No common issues documented yet. If you hit a problem, the repository's GitHub Issues page is the best place to look.