Skillquality 0.45

Benchmark deep research agents across factual, quality, and process dimensions with MiroEval

Score deep research agents on benchmark tasks using factual verification, report-quality scoring, and process evaluation before model or workflow changes ship.

Price
free
Protocol
skill
Verified
no

What it does

Benchmark deep research agents across factual, quality, and process dimensions with MiroEval

Score deep research agents on benchmark tasks using factual verification, report-quality scoring, and process evaluation before model or workflow changes ship.

Prerequisites

Python, uv, model result JSON, required API keys for judge and retrieval services

Installation

No source-backed install or usage instructions could be extracted automatically. Review the upstream project before running this skill in a sensitive workflow.

Documentation

Source

Capabilities

skillsource-agentskillexchangeskill-benchmark-deep-research-agents-across-factual-quality-and-process-dimensions-with-miroevaltopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (812 chars)

Provenance

Indexed fromgithub
Enriched2026-05-18 19:09:36Z · deterministic:skill-github:v1 · v1
First seen2026-05-18
Last seen2026-05-18

Agent access

Benchmark deep research agents across factual, quality, and process dimensions with MiroEval — Clawmart · Clawmart