v1.0.1

Browser Automation

peytoncasper peytoncasper ← All skills

Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications.

Downloads
5.5k
Stars
6
Versions
2
Updated
2026-02-23

Install

npx clawhub@latest install browser-automation

Documentation

Browser Automation

Automate browser interactions using Stagehand CLI with Claude.

First: Environment Selection (Local vs Remote)

The skill automatically selects between local and remote browser environments:

  • -If Browserbase API keys exist (BROWSERBASE_API_KEY and BROWSERBASE_PROJECT_ID in .env file): Uses remote Browserbase environment
  • -If no Browserbase API keys: Falls back to local Chrome browser
  • -No user prompting: The selection happens automatically based on available configuration

Setup (First Time Only)

Check setup.json in this directory. If setupComplete: false:

npm install    # Install dependencies

npm link # Create global 'browser' command

Commands

All commands work identically in both modes:

browser navigate <url>                    # Go to URL

browser act "<action>" # Natural language action

browser extract "<instruction>" ['{}'] # Extract data (optional schema)

browser observe "<query>" # Discover elements

browser screenshot # Take screenshot

browser close # Close browser

Quick Example

browser navigate https://example.com

browser act "click the Sign In button"

browser extract "get the page title"

browser close

Mode Comparison

| Feature | Local | Browserbase |

|---------|-------|-------------|

| Speed | Faster | Slightly slower |

| Setup | Chrome required | API key required |

| Stealth mode | No | Yes |

| Proxy/CAPTCHA | No | Yes |

| Best for | Development | Production/scraping |

Best Practices

1. Always navigate first before interacting

2. View screenshots after each command to verify

3. Be specific in action descriptions

4. Close browser when done

Troubleshooting

  • -Chrome not found: Install Chrome or use Browserbase mode
  • -Action fails: Use browser observe to discover available elements
  • -Browserbase fails: Verify API key and project ID are set

For detailed examples, see [EXAMPLES.md](EXAMPLES.md).

For API reference, see [REFERENCE.md](REFERENCE.md).

Launch an agent with Browser Automation on Termo.