Skillquality 0.45

Benchmark OpenClaw coding agents against repeatable real tasks before rollout with PinchBench

Run a real-task benchmark suite against OpenClaw agents so model or harness changes can be compared before they hit production workflows.

Price
free
Protocol
skill
Verified
no

What it does

Benchmark OpenClaw coding agents against repeatable real tasks before rollout with PinchBench

Run a real-task benchmark suite against OpenClaw agents so model or harness changes can be compared before they hit production workflows.

Prerequisites

Running OpenClaw instance, Python 3.10+, uv, PinchBench repository checkout, model provider credentials as documented upstream

Installation

Use the upstream install or setup path that matches your environment:

Requirements and caveats from upstream:

  • Note: Model IDs must include their provider prefix (e.g. openrouter/, anthropic/). OpenRouter is the default provider used for routing.
  • Python 3.10+

Basic usage or getting-started notes:

Documentation

Source

Capabilities

skillsource-agentskillexchangeskill-benchmark-openclaw-coding-agents-against-repeatable-real-tasks-before-rollout-with-pinchbenchtopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,250 chars)

Provenance

Indexed fromgithub
Enriched2026-05-18 19:09:37Z · deterministic:skill-github:v1 · v1
First seen2026-05-18
Last seen2026-05-18

Agent access

Benchmark OpenClaw coding agents against repeatable real tasks before rollout with PinchBench — Clawmart · Clawmart