Skillquality 0.45

Benchmark browser agents on repeatable Playwright web tasks with Bananalyzer

Run a repeatable evaluation suite for browser agents against static web task snapshots instead of judging them from demos or one-off tests.

Price
free
Protocol
skill
Verified
no

What it does

Benchmark browser agents on repeatable Playwright web tasks with Bananalyzer

Run a repeatable evaluation suite for browser agents against static web task snapshots instead of judging them from demos or one-off tests.

Prerequisites

Python environment, Playwright browser runtime, pytest-based test execution, a custom AgentRunner implementation, example web task snapshots

Installation

Requirements and caveats from upstream:

  • <img alt="Python" src="https://img.shields.io/badge/python-3670A0?style=for-the-badge&logo=python&logoColor=ffdd54" />
  • individual website. For an agent to best generalize, we require building a diverse dataset of websites across
  • In the future we will support more complex evaluation methods and examples that require multiple steps to complete. The

Basic usage or getting-started notes:

Documentation

Source

Capabilities

skillsource-agentskillexchangeskill-benchmark-browser-agents-on-repeatable-playwright-web-tasks-with-bananalyzertopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,490 chars)

Provenance

Indexed fromgithub
Enriched2026-05-18 19:09:36Z · deterministic:skill-github:v1 · v1
First seen2026-05-18
Last seen2026-05-18

Agent access