{"id":"cb24deea-5d27-403d-8883-0caf9c6f58e2","shortId":"8wLjat","kind":"skill","title":"OCRmyPDF Searchable PDF OCR Pipeline","tagline":"OCRmyPDF is an open source tool that adds a searchable OCR text layer to scanned PDFs. It is useful when an agent needs to turn image-based documents into text-searchable files without rebuilding a full document pipeline.","description":"# OCRmyPDF Searchable PDF OCR Pipeline\n\nOCRmyPDF is an open source tool that adds a searchable OCR text layer to scanned PDFs. It is useful when an agent needs to turn image-based documents into text-searchable files without rebuilding a full document pipeline.\n\n## Installation\n\nUse the upstream install or setup path that matches your environment:\n- brew install tesseract-lang\n\nRequirements and caveats from upstream:\n- [pyversions]: https://img.shields.io/pypi/pyversions/ocrmypdf \"Supported Python versions\"\n- Linux, Windows, macOS and FreeBSD are supported. Docker images are also available, for both x64 and ARM.\n- # Add an OCR layer and require PDF/A\n\nBasic usage or getting-started notes:\n- [![Build Status](https://github.com/ocrmypdf/OCRmyPDF/actions/workflows/build.yml/badge.svg)](https://github.com/ocrmypdf/OCRmyPDF/actions/workflows/build.yml) [![PyPI version][pypi]](https://pypi.org/project/ocrmypdf...\n- | Operating system | Install command |\n- | ----------------------------- | ------------------------------|\n\n- Source: https://github.com/ocrmypdf/OCRmyPDF\n- Extracted from upstream docs: https://raw.githubusercontent.com/ocrmypdf/OCRmyPDF/HEAD/README.md\n\n## Source\n\n- [Agent Skill Exchange](https://agentskillexchange.com/skills/ocrmypdf-searchable-pdf-ocr-pipeline/)","tags":["ocrmypdf","searchable","pdf","ocr","pipeline","skills","agentskillexchange","agent-skills","ai-agents","ai-tools","awesome-list","claude-code"],"capabilities":["skill","source-agentskillexchange","skill-ocrmypdf-searchable-pdf-ocr-pipeline","topic-agent-skills","topic-ai-agents","topic-ai-tools","topic-awesome-list","topic-claude-code","topic-codex","topic-cursor","topic-llm","topic-mcp","topic-npx-skills","topic-openclaw","topic-skills-catalog"],"categories":["skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/agentskillexchange/skills/ocrmypdf-searchable-pdf-ocr-pipeline","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add agentskillexchange/skills","source_repo":"https://github.com/agentskillexchange/skills","install_from":"skills.sh"}},"qualityScore":"0.454","qualityRationale":"deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,290 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:11:31.391Z","embedding":null,"createdAt":"2026-05-18T13:18:03.662Z","updatedAt":"2026-05-18T19:11:31.391Z","lastSeenAt":"2026-05-18T19:11:31.391Z","tsv":"'/ocrmypdf/ocrmypdf':169 '/ocrmypdf/ocrmypdf/actions/workflows/build.yml/badge.svg)](https://github.com/ocrmypdf/ocrmypdf/actions/workflows/build.yml)':155 '/ocrmypdf/ocrmypdf/head/readme.md':176 '/project/ocrmypdf...':161 '/pypi/pyversions/ocrmypdf':116 '/skills/ocrmypdf-searchable-pdf-ocr-pipeline/)':183 'add':13,58,137 'agent':27,72,178 'agentskillexchange.com':182 'agentskillexchange.com/skills/ocrmypdf-searchable-pdf-ocr-pipeline/)':181 'also':130 'arm':136 'avail':131 'base':33,78 'basic':144 'brew':103 'build':151 'caveat':110 'command':165 'doc':173 'docker':127 'document':34,44,79,89 'environ':102 'exchang':180 'extract':170 'file':39,84 'freebsd':124 'full':43,88 'get':148 'getting-start':147 'github.com':154,168 'github.com/ocrmypdf/ocrmypdf':167 'github.com/ocrmypdf/ocrmypdf/actions/workflows/build.yml/badge.svg)](https://github.com/ocrmypdf/ocrmypdf/actions/workflows/build.yml)':153 'imag':32,77,128 'image-bas':31,76 'img.shields.io':115 'img.shields.io/pypi/pyversions/ocrmypdf':114 'instal':91,95,104,164 'lang':107 'layer':18,63,140 'linux':120 'maco':122 'match':100 'need':28,73 'note':150 'ocr':4,16,49,61,139 'ocrmypdf':1,6,46,51 'open':9,54 'oper':162 'path':98 'pdf':3,48 'pdf/a':143 'pdfs':21,66 'pipelin':5,45,50,90 'pypi':156,158 'pypi.org':160 'pypi.org/project/ocrmypdf...':159 'python':118 'pyvers':113 'raw.githubusercontent.com':175 'raw.githubusercontent.com/ocrmypdf/ocrmypdf/head/readme.md':174 'rebuild':41,86 'requir':108,142 'scan':20,65 'searchabl':2,15,38,47,60,83 'setup':97 'skill':179 'skill-ocrmypdf-searchable-pdf-ocr-pipeline' 'sourc':10,55,166,177 'source-agentskillexchange' 'start':149 'status':152 'support':117,126 'system':163 'tesseract':106 'tesseract-lang':105 'text':17,37,62,82 'text-search':36,81 'tool':11,56 'topic-agent-skills' 'topic-ai-agents' 'topic-ai-tools' 'topic-awesome-list' 'topic-claude-code' 'topic-codex' 'topic-cursor' 'topic-llm' 'topic-mcp' 'topic-npx-skills' 'topic-openclaw' 'topic-skills-catalog' 'turn':30,75 'upstream':94,112,172 'usag':145 'use':24,69,92 'version':119,157 'window':121 'without':40,85 'x64':134","prices":[{"id":"96686415-6022-4e88-8908-1e731aa5cdc7","listingId":"cb24deea-5d27-403d-8883-0caf9c6f58e2","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"agentskillexchange","category":"skills","install_from":"skills.sh"},"createdAt":"2026-05-18T13:18:03.662Z"}],"sources":[{"listingId":"cb24deea-5d27-403d-8883-0caf9c6f58e2","source":"github","sourceId":"agentskillexchange/skills/ocrmypdf-searchable-pdf-ocr-pipeline","sourceUrl":"https://github.com/agentskillexchange/skills/tree/main/skills/ocrmypdf-searchable-pdf-ocr-pipeline","isPrimary":false,"firstSeenAt":"2026-05-18T13:18:03.662Z","lastSeenAt":"2026-05-18T19:11:31.391Z"}],"details":{"listingId":"cb24deea-5d27-403d-8883-0caf9c6f58e2","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"agentskillexchange","slug":"ocrmypdf-searchable-pdf-ocr-pipeline","github":{"repo":"agentskillexchange/skills","stars":8,"topics":["agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex","cursor","llm","mcp","npx-skills","openclaw","skills-catalog"],"license":"mit","html_url":"https://github.com/agentskillexchange/skills","pushed_at":"2026-05-18T19:02:17Z","description":"The open catalog of AI agent skills — 2,000+ security-scanned skills for Claude Code, Cursor, Codex, and more.","skill_md_sha":"91da3d55f29e095bf2304711dd6e585575e59cc9","skill_md_path":"skills/ocrmypdf-searchable-pdf-ocr-pipeline/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/agentskillexchange/skills/tree/main/skills/ocrmypdf-searchable-pdf-ocr-pipeline"},"layout":"multi","source":"github","category":"skills","frontmatter":{"name":"OCRmyPDF Searchable PDF OCR Pipeline","description":"OCRmyPDF is an open source tool that adds a searchable OCR text layer to scanned PDFs. It is useful when an agent needs to turn image-based documents into text-searchable files without rebuilding a full document pipeline."},"skills_sh_url":"https://skills.sh/agentskillexchange/skills/ocrmypdf-searchable-pdf-ocr-pipeline"},"updatedAt":"2026-05-18T19:11:31.391Z"}}