Skillquality 0.45

Convert dense PDFs into LLM-ready text and page-aligned markdown with olmOCR

Use olmOCR when an agent needs to turn scanned or layout-heavy documents into clean markdown or text before chunking, search, extraction, or citation workflows.

Price

free

Protocol

skill

Verified

Endpoint

https://skills.sh/agentskillexchange/skills/convert-dense-pdfs-into-llm-ready-text-and-page-aligned-markdown-with-olmocr

What it does

Convert dense PDFs into LLM-ready text and page-aligned markdown with olmOCR

Use olmOCR when an agent needs to turn scanned or layout-heavy documents into clean markdown or text before chunking, search, extraction, or citation workflows.

Prerequisites

Python 3.11, pip or conda, poppler-utils, optional NVIDIA GPU for local inference

Installation

Use the upstream install or setup path that matches your environment:

conda create -n olmocr python=3.11
conda activate olmocr
pip install olmocr
pip install olmocr[gpu] --extra-index-url https://download.pytorch.org/whl/cu128

Requirements and caveats from upstream:

(Based on a 7B parameter VLM, so it requires a GPU)
June 17, 2025 - v0.1.75 - Switch from sglang to vllm based inference pipeline, updated docker image to CUDA 12.8.
May 23, 2025 - v0.1.70 - Official docker support and images are now available! See Docker usage

Basic usage or getting-started notes:

System Dependencies
You will need to install poppler-utils and additional fonts for rendering PDF images.
bash
Source: https://github.com/allenai/olmocr
Extracted from upstream docs: https://raw.githubusercontent.com/allenai/olmocr/HEAD/README.md

Documentation

https://github.com/allenai/olmocr#readme

Source

Agent Skill Exchange

Capabilities

skillsource-agentskillexchangeskill-convert-dense-pdfs-into-llm-ready-text-and-page-aligned-markdown-with-olmocrtopic-agent-skillstopic-ai-agentstopic-ai-toolstopic-awesome-listtopic-claude-codetopic-codextopic-cursortopic-llmtopic-mcptopic-npx-skillstopic-openclawtopic-skills-catalog

Install

Installnpx skills add agentskillexchange/skills

Sourcehttps://github.com/agentskillexchange/skills/tree/main/skills/convert-dense-pdfs-into-llm-ready-text-and-page-aligned-markdown-with-olmocr

skills.shhttps://skills.sh/agentskillexchange/skills/convert-dense-pdfs-into-llm-ready-text-and-page-aligned-markdown-with-olmocr

Transportskills-sh

Protocolskill

Quality

0.45/ 1.00

deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,438 chars)

Provenance

Indexed fromgithub

Enriched2026-05-18 19:09:56Z · deterministic:skill-github:v1 · v1

First seen2026-05-18

Last seen2026-05-18

Agent access

JSONhttps://clawmart.sh/api/listings/Lw4hQu

What it does

Convert dense PDFs into LLM-ready text and page-aligned markdown with olmOCR

Prerequisites

Installation

System Dependencies

Documentation

Source

Capabilities

Install

Quality

Provenance

Agent access