January 22, 2026 • Doppio.ai Team

New Skill: Browser Use - AI-Powered Browser Automation

browser

automation

web

scraping

cli

We’ve added a new development skill to the directory: Browser Use from the browser-use team. This tool enables AI agents to autonomously interact with websites through a combination of CLI commands and visual understanding.

What Browser Use Does

Browser Use bridges the gap between language models and web browsers. It gives AI agents the ability to “see” web pages through screenshots and clickable element detection, then interact with them just like a human would.

CLI Commands

The command-line interface provides direct browser control:

browser-use open <url> - Navigate to any website
browser-use state - List all interactive elements with indices
browser-use click [index] - Click on specific elements
browser-use type - Input text into forms
browser-use screenshot - Capture the current view
browser-use sessions - Manage multiple parallel browsing sessions

Browser Modes

Browser Use supports multiple modes for different use cases:

Headless (Default)

Fast, invisible browser execution for automation tasks where you don’t need to see what’s happening.

Headed

Visible browser window so you can watch the agent work. Useful for debugging and demonstrations.

Real Chrome

Uses your existing Chrome installation with saved logins and cookies. Perfect for tasks requiring authentication you’ve already set up.

Cloud

Stealth browser with built-in proxies and anti-detection features. Handles fingerprinting and CAPTCHA challenges for production workloads.

Key Features

Vision-Based Interaction

The agent receives screenshots and a map of clickable elements, enabling visual understanding of page layouts and dynamic content.

Parallel Sessions

Manage multiple browser sessions simultaneously for tasks like comparison shopping, multi-account workflows, or parallel data collection.

Custom Tools

Extend agent capabilities with custom tools using Python decorators for domain-specific automation.

Use Cases

Browser Use is useful for:

Form automation - Fill complex applications and multi-step forms
E-commerce tasks - Add items to carts, compare prices across sites
Research - Browse and aggregate information from multiple sources
Web scraping - Extract data from dynamic, JavaScript-heavy pages
Testing - Automate user flows and verify functionality

Get Started

Browse the Browser Use skill in the directory. For installation instructions and detailed documentation, visit the GitHub repository.

The latest release (0.11.4) includes the CLI and companion skill specifically designed for Claude Code integration.

Featured Skills

Browser Use

AI-powered browser automation with CLI and skill. Control browsers via commands, capture screenshots, click elements, and manage parallel sessions in headless, headed, or cloud modes.

developmentby browser-use