MCPquality 0.57

Token Compressor

Semantic prompt compression using local LLM rewriting with embedding validation to reduce token usage by 40-60%.

Price
free
Protocol
mcp
Verified
no

What it does

Semantic prompt compression using local LLM rewriting with embedding validation to reduce token usage by 40-60%.

Token Compressor is a two-stage pipeline that compresses prompts before they reach an LLM. The first stage uses a local model (llama3.2:1b via Ollama) to rewrite prompts to their semantic minimum while preserving all conditionals and negations. The second stage validates compression quality by computing cosine similarity between original and compressed embeddings using nomic-embed-text, falling back to the original if similarity drops below a configurable threshold.

The system requires Ollama running locally and exposes a compress_prompt tool. It achieves 40-60% token reduction across English and Swedish prompts while maintaining semantic fidelity through the embedding validation gate.

Capabilities

mcptransport-stdioopen-source

Server

Quality

0.57/ 1.00

deterministic score 0.57 from registry signals: · indexed on pulsemcp · has source repo · 8 github stars · registry-generated description present

Provenance

Indexed frompulsemcp
Enriched2026-05-02 07:22:08Z · deterministic:mcp:v1 · v1
First seen2026-04-18
Last seen2026-05-02

Agent access