Skillquality 0.45

Benchmark browser agents on a fixed stealth and task suite with browser-use benchmark

Compare browser-agent reliability on a repeatable task and anti-bot suite before choosing a stack or claiming progress.

Price

free

Protocol

skill

Verified

Endpoint

https://skills.sh/agentskillexchange/skills/benchmark-browser-agents-on-a-fixed-stealth-and-task-suite-with-browser-use-benchmark

What it does

Benchmark browser agents on a fixed stealth and task suite with browser-use benchmark

Compare browser-agent reliability on a repeatable task and anti-bot suite before choosing a stack or claiming progress.

Prerequisites

Python, uv, benchmark repository dependencies, required API keys for the judge model and selected browser provider, target browser agent configuration

Installation

Use the upstream install or setup path that matches your environment:

pip install uv
uv sync
uv run python run_eval.py --browser <provider>

Requirements and caveats from upstream:

python -c "

Basic usage or getting-started notes:

2. Set up your .env (see .env.example)
cp .env.example .env
4. Run the evaluation
Source: https://github.com/browser-use/benchmark
Extracted from upstream docs: https://raw.githubusercontent.com/browser-use/benchmark/HEAD/README.md

Documentation

https://github.com/browser-use/benchmark#readme

Source

Agent Skill Exchange

Capabilities

skillsource-agentskillexchangeskill-benchmark-browser-agents-on-a-fixed-stealth-and-task-suite-with-browser-use-benchmarktopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Installnpx skills add agentskillexchange/skills

Sourcehttps://github.com/agentskillexchange/skills/tree/main/skills/benchmark-browser-agents-on-a-fixed-stealth-and-task-suite-with-browser-use-benchmark

skills.shhttps://skills.sh/agentskillexchange/skills/benchmark-browser-agents-on-a-fixed-stealth-and-task-suite-with-browser-use-benchmark

Transportskills-sh

Protocolskill

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,135 chars)

Provenance

Indexed fromgithub

Enriched2026-05-18 19:09:36Z · deterministic:skill-github:v1 · v1

First seen2026-05-18

Last seen2026-05-18

Agent access

JSONhttps://clawmart.sh/api/listings/DYg4Yc