Skillquality 0.57

mcp-local-rag

Search, ingest, expand chunk context, or manage local documents via a local RAG MCP server (tools: query_documents, read_chunk_neighbors, ingest_file, ingest_data, delete_file, list_files). Use when user says "search my docs", "save this page", "read around that chunk", "what did

Price
free
Protocol
skill
Verified
no

What it does

MCP Local RAG Skills

Tools

MCP ToolCLI EquivalentUse When
ingest_filenpx mcp-local-rag ingest <path>Local files (PDF, DOCX, TXT, MD). CLI for bulk/directory.
ingest_dataRaw content (HTML, text) with source URL
query_documentsnpx mcp-local-rag query <text>Semantic + keyword hybrid search
delete_filenpx mcp-local-rag delete <path>Remove ingested content
list_filesnpx mcp-local-rag listFile ingestion status
statusnpx mcp-local-rag statusDatabase stats
read_chunk_neighborsnpx mcp-local-rag read-neighborsRead N chunks adjacent to a known chunkIndex (context expansion; call after query_documents or grep)

Search: Core Rules

Hybrid search combines vector (semantic) and keyword (BM25).

Score Interpretation

Lower = better match. Use this to filter noise.

ScoreAction
< 0.3Use directly
0.3-0.5Include if mentions same concept/entity
0.5-0.7Include only if directly relevant to the question
> 0.7Skip unless no better results

Limit Selection

IntentLimit
Specific answer (function, error)5
General understanding10
Comprehensive survey20

Query Formulation

SituationWhy TransformAction
Specific term mentionedKeyword search needs exact matchKEEP term
Vague queryVector search needs semantic signalADD context
Error stack or code blockLong text dilutes relevanceEXTRACT core keywords
Multiple distinct topicsSingle query conflates resultsSPLIT queries
Few/poor resultsTerm mismatchEXPAND (see below)

Query Expansion

When results are few or all score > 0.5, expand query terms:

  • Keep original term first, add 2-4 variants
  • Types: synonyms, abbreviations, related terms, word forms
  • Example: "config""config configuration settings configure"

Avoid over-expansion (causes topic drift).

Result Selection

When to include vs skip—based on answer quality, not just score.

INCLUDE if:

  • Directly answers the question
  • Provides necessary context
  • Score < 0.5

SKIP if:

  • Same keyword, unrelated context
  • Score > 0.7
  • Mentions term without explanation

fileTitle

Each result includes fileTitle (document title extracted from content). Null when extraction fails.

UseHow
Disambiguate chunksUse fileTitle to identify which document the chunk belongs to
Group related chunksSame fileTitle = same document context
Deprioritize mismatchesfileTitle unrelated to query AND score > 0.5 → rank lower

Context Expansion (read_chunk_neighbors)

read_chunk_neighbors (CLI: read-neighbors) is an on-demand context expansion utility, not a routine follow-up to every query_documents call. Chunks in this index are semantic units — sentences or paragraphs grouped by topic via Max-Min semantic chunking, not fixed-size text slices. Reading the chunks immediately before and after a target chunk yields coherent surrounding context, not arbitrary fragments.

Each query_documents result item already includes filePath and chunkIndex. Pass those to read_chunk_neighbors to expand a specific hit in place.

Trigger this tool only when one of these signals is present:

  • Insufficient context for your answer: during response generation, the target chunk alone is not enough to reach a grounded conclusion (e.g., it references "this approach" or "as shown above" without the referent).
  • Explicit user request for more context: the user asks for surrounding detail ("what comes before that?", "read more around that section", "show me the full explanation").

If neither signal is present, stop at the query_documents results.

Typical workflow when triggered:

  1. Identify the specific chunk to expand (from a prior query_documents hit or grep).
  2. Take that chunk's filePath and chunkIndex.
  3. Call read_chunk_neighbors with those values; the response contains the target chunk plus its semantic neighbors, sorted by chunkIndex.

See cli-reference.md for output fields and an example.

Ingestion

ingest_file

ingest_file({ filePath: "/absolute/path/to/document.pdf" })

ingest_data

ingest_data({
  content: "<html>...</html>",
  metadata: { source: "https://example.com/page", format: "html" }
})

Format selection — match the data you have:

  • HTML string → format: "html"
  • Markdown string → format: "markdown"
  • Other → format: "text"

Source format:

  • Web page → Use URL: https://example.com/page
  • Other content → Use scheme: {type}://{date} or {type}://{date}/{detail} where {type} is a short identifier for the content origin (e.g., clipboard, chat, note, meeting)

HTML source options:

  • Static page → HTTP fetch
  • SPA/JS-rendered → Browser/web tool with DOM rendering
  • Auth required → Manual paste

If HTTP fetch returns empty or minimal content, retry with a browser/web tool.

Source URLs are normalized: query strings and fragments are stripped. See html-ingestion.md for cases where this matters.

Re-ingest same source to update. Use same source in delete_file to remove.

CLI commands

CLI subcommands mirror MCP tools. Useful for bulk operations, scripting, and environments without MCP.

  • query, list, status, delete output JSON to stdout
  • ingest outputs progress to stderr
  • Use --help on any command for options
  • See cli-reference.md for options and config matching

References

For edge cases and examples:

Capabilities

skillsource-shinprskill-mcp-local-ragtopic-agent-skillstopic-developer-toolstopic-hybrid-searchtopic-local-firsttopic-local-ragtopic-mcptopic-mcp-servertopic-privacy-firsttopic-ragtopic-semantic-searchtopic-skillstopic-vector-search

Install

Installnpx skills add shinpr/mcp-local-rag
Transportskills-sh
Protocolskill

Quality

0.57/ 1.00

deterministic score 0.57 from registry signals: · indexed on github topic:agent-skills · 242 github stars · SKILL.md body (6,198 chars)

Provenance

Indexed fromgithub
Enriched2026-05-02 18:54:12Z · deterministic:skill-github:v1 · v1
First seen2026-04-18
Last seen2026-05-02

Agent access

mcp-local-rag — Clawmart · Clawmart