Skillquality 0.45

Train agent policies with rLLM reinforcement learning

Use rLLM to evaluate, trace, reward, and train LLM agents with reinforcement learning across common agent frameworks.

Price

free

Protocol

skill

Verified

Endpoint

https://skills.sh/agentskillexchange/skills/train-agent-policies-with-rllm-reinforcement-learning

What it does

Train agent policies with rLLM reinforcement learning

Use rLLM to evaluate, trace, reward, and train LLM agents with reinforcement learning across common agent frameworks.

Prerequisites

Python 3.11 or newer, rLLM, agent code or benchmark task, reward/evaluator function, optional Tinker or verl training backend

Installation

Use the upstream install or setup path that matches your environment:

uv pip install "rllm @ git+https://github.com/rllm-org/rllm.git"
uv pip install rllm[verl] @ git+https://github.com/rllm-org/rllm.git

Requirements and caveats from upstream:

rLLM requires Python >= 3.11. You can install it either directly via pip or build from source.
For building from source or Docker, see the installation guide.
Option B: Python API

Basic usage or getting-started notes:

bash
this installs dependencies for running rllm cli, which uses Tinker as the training backend.
To use verl as the training backend (GPU machine required), install via
Source: https://github.com/rllm-org/rllm
Extracted from upstream docs: https://raw.githubusercontent.com/rllm-org/rllm/HEAD/README.md

Documentation

https://docs.rllm-project.com

Source

Agent Skill Exchange

Capabilities

skillsource-agentskillexchangeskill-train-agent-policies-with-rllm-reinforcement-learningtopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Installnpx skills add agentskillexchange/skills

Sourcehttps://github.com/agentskillexchange/skills/tree/main/skills/train-agent-policies-with-rllm-reinforcement-learning

skills.shhttps://skills.sh/agentskillexchange/skills/train-agent-policies-with-rllm-reinforcement-learning

Transportskills-sh

Protocolskill

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,357 chars)

Provenance

Indexed fromgithub

Enriched2026-05-18 19:12:53Z · deterministic:skill-github:v1 · v1

First seen2026-05-18

Last seen2026-05-18

Agent access

JSONhttps://clawmart.sh/api/listings/sZ5b4Q

What it does

Train agent policies with rLLM reinforcement learning

Prerequisites

Installation

Option B: Python API

Documentation

Source

Capabilities

Install

Quality

Provenance

Agent access