Skillquality 0.45
Evaluate long-horizon agents against WildClawBench
Use WildClawBench to benchmark agents on hard end-to-end OpenClaw tasks covering tool orchestration, multimodal work, coding, safety, and long-horizon planning.
Price
free
Protocol
skill
Verified
no
What it does
Evaluate long-horizon agents against WildClawBench
Use WildClawBench to benchmark agents on hard end-to-end OpenClaw tasks covering tool orchestration, multimodal work, coding, safety, and long-horizon planning.
Prerequisites
WildClawBench assets; OpenClaw environment; target agent/model under test
Installation
No source-backed install or usage instructions could be extracted automatically. Review the upstream project before running this skill in a sensitive workflow.
Documentation
Source
Capabilities
skillsource-agentskillexchangeskill-evaluate-long-horizon-agents-against-wildclawbenchtopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog
Install
Installnpx skills add agentskillexchange/skills
skills.shhttps://skills.sh/agentskillexchange/skills/evaluate-long-horizon-agents-against-wildclawbench
Transportskills-sh
Protocolskill
Quality
0.45/ 1.00
deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (729 chars)
Provenance
Indexed fromgithub
Enriched2026-05-18 19:10:22Z · deterministic:skill-github:v1 · v1
First seen2026-05-18
Last seen2026-05-18