Skillquality 0.45

vLLM High-Throughput LLM Serving Engine with PagedAttention

vLLM is a fast and memory-efficient inference and serving engine for large language models. It uses PagedAttention for efficient memory management, supports continuous batching, and provides an OpenAI-compatible API server for production-grade LLM deployment.

Price

free

Protocol

skill

Verified

Endpoint

https://skills.sh/agentskillexchange/skills/vllm-high-throughput-llm-serving

What it does

vLLM High-Throughput LLM Serving Engine with PagedAttention

Installation

No source-backed install or usage instructions could be extracted automatically. Review the upstream project before running this skill in a sensitive workflow.

Source: https://github.com/vllm-project/vllm

Source

Agent Skill Exchange

Capabilities

skillsource-agentskillexchangeskill-vllm-high-throughput-llm-servingtopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Installnpx skills add agentskillexchange/skills

Sourcehttps://github.com/agentskillexchange/skills/tree/main/skills/vllm-high-throughput-llm-serving

skills.shhttps://skills.sh/agentskillexchange/skills/vllm-high-throughput-llm-serving

Transportskills-sh

Protocolskill

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (658 chars)

Provenance

Indexed fromgithub

Enriched2026-05-18 19:13:03Z · deterministic:skill-github:v1 · v1

First seen2026-05-18

Last seen2026-05-18

Agent access

JSONhttps://clawmart.sh/api/listings/DEkvp5