{"id":"3207c657-4c7a-4b4a-be61-ad60dca60b75","shortId":"uRh7Le","kind":"skill","title":"PaddleOCR Multilingual Document OCR and Structured Data Toolkit","tagline":"PaddleOCR is a powerful, lightweight OCR toolkit developed by Baidu that converts documents and images into structured, AI-friendly data like JSON and Markdown. It supports 100+ languages with industry-leading accuracy, bridging the gap between images/PDFs and LLMs.","description":"# PaddleOCR Multilingual Document OCR and Structured Data Toolkit\n\nPaddleOCR is a powerful, lightweight OCR toolkit developed by Baidu that converts documents and images into structured, AI-friendly data like JSON and Markdown. It supports 100+ languages with industry-leading accuracy, bridging the gap between images/PDFs and LLMs.\n\n## Installation\n\nRequirements and caveats from upstream:\n- ![python](https://img.shields.io/badge/python-3.8~3.12-aff.svg)\n- **Comprehensive upgrade of the PP-OCRv5 C++ local deployment solution, now supporting both Linux and Windows, with feature parity and identical accuracy to the Python implementation.**\n- **The high-stability service-oriented deployment solution is now fully open-sourced, allowing users to customize Docker images and SDKs as required.**\n\nBasic usage or getting-started notes:\n- **Documentation has been updated to include key metrics for commonly used configurations on mainstream hardware, such as inference latency and memory usage, providing deployment references for users.**\n- ## 🚀 Quick Start\n- For local usage, please refer to the following documentation based on your needs:\n\n- Source: https://github.com/PaddlePaddle/PaddleOCR\n- Extracted from upstream docs: https://raw.githubusercontent.com/PaddlePaddle/PaddleOCR/HEAD/README.md\n\n## Source\n\n- [Agent Skill Exchange](https://agentskillexchange.com/skills/paddleocr-multilingual-document-ocr-toolkit/)","tags":["paddleocr","multilingual","document","ocr","toolkit","skills","agentskillexchange","agent-skills","ai-agents","ai-tools","awesome-list","claude-code"],"capabilities":["skill","source-agentskillexchange","skill-paddleocr-multilingual-document-ocr-toolkit","topic-agent-skills","topic-ai-agents","topic-ai-tools","topic-awesome-list","topic-claude-code","topic-codex","topic-cursor","topic-llm","topic-mcp","topic-npx-skills","topic-openclaw","topic-skills-catalog"],"categories":["skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/agentskillexchange/skills/paddleocr-multilingual-document-ocr-toolkit","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add agentskillexchange/skills","source_repo":"https://github.com/agentskillexchange/skills","install_from":"skills.sh"}},"qualityScore":"0.454","qualityRationale":"deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,422 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:11:38.837Z","embedding":null,"createdAt":"2026-05-18T13:18:13.984Z","updatedAt":"2026-05-18T19:11:38.837Z","lastSeenAt":"2026-05-18T19:11:38.837Z","tsv":"'/badge/python-3.8~3.12-aff.svg)':108 '/paddlepaddle/paddleocr':213 '/paddlepaddle/paddleocr/head/readme.md':220 '/skills/paddleocr-multilingual-document-ocr-toolkit/)':227 '100':36,85 'accuraci':42,91,131 'agent':222 'agentskillexchange.com':226 'agentskillexchange.com/skills/paddleocr-multilingual-document-ocr-toolkit/)':225 'ai':27,76 'ai-friend':26,75 'allow':151 'baidu':18,67 'base':206 'basic':161 'bridg':43,92 'c':116 'caveat':102 'common':177 'comprehens':109 'configur':179 'convert':20,69 'custom':154 'data':7,29,56,78 'deploy':118,143,191 'develop':16,65 'doc':217 'docker':155 'document':3,21,52,70,168,205 'exchang':224 'extract':214 'featur':127 'follow':204 'friend':28,77 'fulli':147 'gap':45,94 'get':165 'getting-start':164 'github.com':212 'github.com/paddlepaddle/paddleocr':211 'hardwar':182 'high':138 'high-stabl':137 'ident':130 'imag':23,72,156 'images/pdfs':47,96 'img.shields.io':107 'img.shields.io/badge/python-3.8~3.12-aff.svg)':106 'implement':135 'includ':173 'industri':40,89 'industry-lead':39,88 'infer':185 'instal':99 'json':31,80 'key':174 'languag':37,86 'latenc':186 'lead':41,90 'lightweight':13,62 'like':30,79 'linux':123 'llms':49,98 'local':117,198 'mainstream':181 'markdown':33,82 'memori':188 'metric':175 'multilingu':2,51 'need':209 'note':167 'ocr':4,14,53,63 'ocrv5':115 'open':149 'open-sourc':148 'orient':142 'paddleocr':1,9,50,58 'pariti':128 'pleas':200 'power':12,61 'pp':114 'pp-ocrv5':113 'provid':190 'python':105,134 'quick':195 'raw.githubusercontent.com':219 'raw.githubusercontent.com/paddlepaddle/paddleocr/head/readme.md':218 'refer':192,201 'requir':100,160 'sdks':158 'servic':141 'service-ori':140 'skill':223 'skill-paddleocr-multilingual-document-ocr-toolkit' 'solut':119,144 'sourc':150,210,221 'source-agentskillexchange' 'stabil':139 'start':166,196 'structur':6,25,55,74 'support':35,84,121 'toolkit':8,15,57,64 'topic-agent-skills' 'topic-ai-agents' 'topic-ai-tools' 'topic-awesome-list' 'topic-claude-code' 'topic-codex' 'topic-cursor' 'topic-llm' 'topic-mcp' 'topic-npx-skills' 'topic-openclaw' 'topic-skills-catalog' 'updat':171 'upgrad':110 'upstream':104,216 'usag':162,189,199 'use':178 'user':152,194 'window':125","prices":[{"id":"5e63b9d5-00ac-451c-9a47-6f38551954b2","listingId":"3207c657-4c7a-4b4a-be61-ad60dca60b75","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"agentskillexchange","category":"skills","install_from":"skills.sh"},"createdAt":"2026-05-18T13:18:13.984Z"}],"sources":[{"listingId":"3207c657-4c7a-4b4a-be61-ad60dca60b75","source":"github","sourceId":"agentskillexchange/skills/paddleocr-multilingual-document-ocr-toolkit","sourceUrl":"https://github.com/agentskillexchange/skills/tree/main/skills/paddleocr-multilingual-document-ocr-toolkit","isPrimary":false,"firstSeenAt":"2026-05-18T13:18:13.984Z","lastSeenAt":"2026-05-18T19:11:38.837Z"}],"details":{"listingId":"3207c657-4c7a-4b4a-be61-ad60dca60b75","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"agentskillexchange","slug":"paddleocr-multilingual-document-ocr-toolkit","github":{"repo":"agentskillexchange/skills","stars":8,"topics":["agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex","cursor","llm","mcp","npx-skills","openclaw","skills-catalog"],"license":"mit","html_url":"https://github.com/agentskillexchange/skills","pushed_at":"2026-05-18T19:02:17Z","description":"The open catalog of AI agent skills — 2,000+ security-scanned skills for Claude Code, Cursor, Codex, and more.","skill_md_sha":"54719ddf5109f0db17e9f5dacca5cca30699ee8d","skill_md_path":"skills/paddleocr-multilingual-document-ocr-toolkit/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/agentskillexchange/skills/tree/main/skills/paddleocr-multilingual-document-ocr-toolkit"},"layout":"multi","source":"github","category":"skills","frontmatter":{"name":"PaddleOCR Multilingual Document OCR and Structured Data Toolkit","description":"PaddleOCR is a powerful, lightweight OCR toolkit developed by Baidu that converts documents and images into structured, AI-friendly data like JSON and Markdown. It supports 100+ languages with industry-leading accuracy, bridging the gap between images/PDFs and LLMs."},"skills_sh_url":"https://skills.sh/agentskillexchange/skills/paddleocr-multilingual-document-ocr-toolkit"},"updatedAt":"2026-05-18T19:11:38.837Z"}}