Skillquality 0.45

Grade agent trajectories and tool-use decisions with AgentEvals

Score whether an agent took a sensible intermediate path, called tools correctly, and reached the outcome without relying only on final-answer checks.

Price

free

Protocol

skill

Verified

Endpoint

https://skills.sh/agentskillexchange/skills/grade-agent-trajectories-and-tool-use-decisions-with-agentevals

What it does

Grade agent trajectories and tool-use decisions with AgentEvals

Score whether an agent took a sensible intermediate path, called tools correctly, and reached the outcome without relying only on final-answer checks.

Prerequisites

Python or TypeScript runtime, agent run outputs or trajectories, optional LLM judge provider

Installation

Use the upstream install or setup path that matches your environment:

pip install agentevals
npm install agentevals @langchain/core
pip install openai
npm install openai

Requirements and caveats from upstream:

<summary>Python</summary>
python
Python Async Support

Basic usage or getting-started notes:

To get started, install agentevals:
<details open>
bash
Source: https://github.com/langchain-ai/agentevals
Extracted from upstream docs: https://raw.githubusercontent.com/langchain-ai/agentevals/HEAD/README.md

Documentation

https://github.com/langchain-ai/agentevals

Source

Agent Skill Exchange

Capabilities

skillsource-agentskillexchangeskill-grade-agent-trajectories-and-tool-use-decisions-with-agentevalstopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Installnpx skills add agentskillexchange/skills

Sourcehttps://github.com/agentskillexchange/skills/tree/main/skills/grade-agent-trajectories-and-tool-use-decisions-with-agentevals

skills.shhttps://skills.sh/agentskillexchange/skills/grade-agent-trajectories-and-tool-use-decisions-with-agentevals

Transportskills-sh

Protocolskill

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,116 chars)

Provenance

Indexed fromgithub

Enriched2026-05-18 19:10:44Z · deterministic:skill-github:v1 · v1

First seen2026-05-18

Last seen2026-05-18

Agent access

JSONhttps://clawmart.sh/api/listings/mV6K8p