Skillquality 0.45
Benchmark virtual agents with scripted multi-turn conversations using Agent Evaluation
Run concurrent scripted conversations against a target agent to measure whether it stays on task, responds correctly, and holds up in repeatable test cases.
Price
free
Protocol
skill
Verified
no
What it does
Benchmark virtual agents with scripted multi-turn conversations using Agent Evaluation
Run concurrent scripted conversations against a target agent to measure whether it stays on task, responds correctly, and holds up in repeatable test cases.
Prerequisites
Python environment, target agent endpoint or integration, optional AWS services such as Bedrock or SageMaker
Installation
No source-backed install or usage instructions could be extracted automatically. Review the upstream project before running this skill in a sensitive workflow.
Documentation
Source
Capabilities
skillsource-agentskillexchangeskill-benchmark-virtual-agents-with-scripted-multi-turn-conversations-using-agent-evaluationtopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog
Install
Installnpx skills add agentskillexchange/skills
Transportskills-sh
Protocolskill
Quality
0.45/ 1.00
deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (836 chars)
Provenance
Indexed fromgithub
Enriched2026-05-18 19:09:37Z · deterministic:skill-github:v1 · v1
First seen2026-05-18
Last seen2026-05-18