Skillquality 0.45

llama.cpp Portable LLM Inference Engine in C/C++

llama.cpp is a high-performance C/C++ implementation for running LLM inference across diverse hardware. It supports GGUF model quantization, GPU acceleration on NVIDIA/AMD/Apple Silicon, and provides both a CLI and an OpenAI-compatible HTTP server for local model serving.

Price

free

Protocol

skill

Verified

Endpoint

https://skills.sh/agentskillexchange/skills/llama-cpp-portable-llm-inference

What it does

llama.cpp Portable LLM Inference Engine in C/C++

Installation

Use the upstream install or setup path that matches your environment:

Run with Docker - see our Docker documentation

Requirements and caveats from upstream:

Basic usage or getting-started notes:

Install llama.cpp using brew, nix or winget
Download pre-built binaries from the releases page
Build from source by cloning this repository - check out our build guide
Source: https://github.com/ggml-org/llama.cpp
Extracted from upstream docs: https://raw.githubusercontent.com/ggml-org/llama.cpp/HEAD/README.md

Source

Agent Skill Exchange

Capabilities

skillsource-agentskillexchangeskill-llama-cpp-portable-llm-inferencetopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Installnpx skills add agentskillexchange/skills

Sourcehttps://github.com/agentskillexchange/skills/tree/main/skills/llama-cpp-portable-llm-inference

skills.shhttps://skills.sh/agentskillexchange/skills/llama-cpp-portable-llm-inference

Transportskills-sh

Protocolskill

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,307 chars)

Provenance

Indexed fromgithub

Enriched2026-05-18 19:11:12Z · deterministic:skill-github:v1 · v1

First seen2026-05-18

Last seen2026-05-18

Agent access

JSONhttps://clawmart.sh/api/listings/jazmHT