{"id":"e36af1f2-9854-422f-a7b4-96bbf4c6ba70","shortId":"YCPdxM","kind":"skill","title":"Generate LLM fine-tuning, RAG, and eval datasets from source material with easy-dataset","tagline":"Turn raw documents into structured fine-tuning, RAG, and evaluation datasets when the real job is dataset preparation, not generic document parsing.","description":"# Generate LLM fine-tuning, RAG, and eval datasets from source material with easy-dataset\n\nTurn raw documents into structured fine-tuning, RAG, and evaluation datasets when the real job is dataset preparation, not generic document parsing.\n\n## Prerequisites\n\neasy-dataset application, supported source documents such as PDF/Markdown/DOCX/TXT/EPUB, and an operator or agent preparing datasets\n\n## Installation\n\nUse the upstream install or setup path that matches your environment:\n- git clone https://github.com/ConardLi/easy-dataset.git\n- npm install\n- npm run build\n- npm run start\n\nRequirements and caveats from upstream:\n- ### Using the Official Docker Image\n- Modify the docker-compose.yml file:\n\nBasic usage or getting-started notes:\n- [Features](#features) • [Quick Start](#local-run) • [Documentation](https://docs.easy-dataset.com/ed/en) • [Contributing](#contributing) • [License](#license)\n- 🎉🎉 Easy Dataset Version 1.7.0 launches brand-new evaluation capabilities! You can effortlessly convert domain-specific documents into evaluation datasets (test sets) and automatically run multi-dimensional evaluation...\n- ## Local Run\n\n- Source: https://github.com/ConardLi/easy-dataset\n- Extracted from upstream docs: https://raw.githubusercontent.com/ConardLi/easy-dataset/HEAD/README.md\n\n## Documentation\n\n- https://github.com/ConardLi/easy-dataset#readme\n\n## Source\n\n- [Agent Skill Exchange](https://agentskillexchange.com/skills/generate-llm-fine-tuning-rag-and-eval-datasets-from-source-material-with-easy-dataset/)","tags":["generate","llm","fine","tuning","rag","and","eval","datasets","from","source","material","with"],"capabilities":["skill","source-agentskillexchange","skill-generate-llm-fine-tuning-rag-and-eval-datasets-from-source-material-with-easy-dataset","topic-agent-skills","topic-ai-agents","topic-ai-tools","topic-awesome-list","topic-claude-code","topic-codex","topic-cursor","topic-llm","topic-mcp","topic-npx-skills","topic-openclaw","topic-skills-catalog"],"categories":["skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/agentskillexchange/skills/generate-llm-fine-tuning-rag-and-eval-datasets-from-source-material-with-easy-dataset","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add agentskillexchange/skills","source_repo":"https://github.com/agentskillexchange/skills","install_from":"skills.sh"}},"qualityScore":"0.454","qualityRationale":"deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,525 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:10:34.231Z","embedding":null,"createdAt":"2026-05-18T13:16:41.300Z","updatedAt":"2026-05-18T19:10:34.231Z","lastSeenAt":"2026-05-18T19:10:34.231Z","tsv":"'/conardli/easy-dataset':193 '/conardli/easy-dataset#readme':204 '/conardli/easy-dataset.git':113 '/conardli/easy-dataset/head/readme.md':200 '/ed/en)':153 '/skills/generate-llm-fine-tuning-rag-and-eval-datasets-from-source-material-with-easy-dataset/)':211 '1.7.0':161 'agent':94,206 'agentskillexchange.com':210 'agentskillexchange.com/skills/generate-llm-fine-tuning-rag-and-eval-datasets-from-source-material-with-easy-dataset/)':209 'applic':83 'automat':182 'basic':136 'brand':164 'brand-new':163 'build':118 'capabl':167 'caveat':124 'clone':110 'contribut':154,155 'convert':171 'dataset':9,16,28,34,48,55,67,73,82,96,159,178 'dimension':186 'doc':197 'docker':130 'docker-compose.yml':134 'docs.easy-dataset.com':152 'docs.easy-dataset.com/ed/en)':151 'document':19,38,58,77,86,150,175,201 'domain':173 'domain-specif':172 'easi':15,54,81,158 'easy-dataset':14,53,80 'effortless':170 'environ':108 'eval':8,47 'evalu':27,66,166,177,187 'exchang':208 'extract':194 'featur':143,144 'file':135 'fine':4,23,43,62 'fine-tun':3,22,42,61 'generat':1,40 'generic':37,76 'get':140 'getting-start':139 'git':109 'github.com':112,192,203 'github.com/conardli/easy-dataset':191 'github.com/conardli/easy-dataset#readme':202 'github.com/conardli/easy-dataset.git':111 'imag':131 'instal':97,101,115 'job':32,71 'launch':162 'licens':156,157 'llm':2,41 'local':148,188 'local-run':147 'match':106 'materi':12,51 'modifi':132 'multi':185 'multi-dimension':184 'new':165 'note':142 'npm':114,116,119 'offici':129 'oper':92 'pars':39,78 'path':104 'pdf/markdown/docx/txt/epub':89 'prepar':35,74,95 'prerequisit':79 'quick':145 'rag':6,25,45,64 'raw':18,57 'raw.githubusercontent.com':199 'raw.githubusercontent.com/conardli/easy-dataset/head/readme.md':198 'real':31,70 'requir':122 'run':117,120,149,183,189 'set':180 'setup':103 'skill':207 'skill-generate-llm-fine-tuning-rag-and-eval-datasets-from-source-material-with-easy-dataset' 'sourc':11,50,85,190,205 'source-agentskillexchange' 'specif':174 'start':121,141,146 'structur':21,60 'support':84 'test':179 'topic-agent-skills' 'topic-ai-agents' 'topic-ai-tools' 'topic-awesome-list' 'topic-claude-code' 'topic-codex' 'topic-cursor' 'topic-llm' 'topic-mcp' 'topic-npx-skills' 'topic-openclaw' 'topic-skills-catalog' 'tune':5,24,44,63 'turn':17,56 'upstream':100,126,196 'usag':137 'use':98,127 'version':160","prices":[{"id":"b473386a-be29-4f1d-a227-d446757513f2","listingId":"e36af1f2-9854-422f-a7b4-96bbf4c6ba70","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"agentskillexchange","category":"skills","install_from":"skills.sh"},"createdAt":"2026-05-18T13:16:41.300Z"}],"sources":[{"listingId":"e36af1f2-9854-422f-a7b4-96bbf4c6ba70","source":"github","sourceId":"agentskillexchange/skills/generate-llm-fine-tuning-rag-and-eval-datasets-from-source-material-with-easy-dataset","sourceUrl":"https://github.com/agentskillexchange/skills/tree/main/skills/generate-llm-fine-tuning-rag-and-eval-datasets-from-source-material-with-easy-dataset","isPrimary":false,"firstSeenAt":"2026-05-18T13:16:41.300Z","lastSeenAt":"2026-05-18T19:10:34.231Z"}],"details":{"listingId":"e36af1f2-9854-422f-a7b4-96bbf4c6ba70","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"agentskillexchange","slug":"generate-llm-fine-tuning-rag-and-eval-datasets-from-source-material-with-easy-dataset","github":{"repo":"agentskillexchange/skills","stars":8,"topics":["agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex","cursor","llm","mcp","npx-skills","openclaw","skills-catalog"],"license":"mit","html_url":"https://github.com/agentskillexchange/skills","pushed_at":"2026-05-18T19:02:17Z","description":"The open catalog of AI agent skills — 2,000+ security-scanned skills for Claude Code, Cursor, Codex, and more.","skill_md_sha":"748708394403a2f49831877f666c5037f9d12b2c","skill_md_path":"skills/generate-llm-fine-tuning-rag-and-eval-datasets-from-source-material-with-easy-dataset/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/agentskillexchange/skills/tree/main/skills/generate-llm-fine-tuning-rag-and-eval-datasets-from-source-material-with-easy-dataset"},"layout":"multi","source":"github","category":"skills","frontmatter":{"name":"Generate LLM fine-tuning, RAG, and eval datasets from source material with easy-dataset","description":"Turn raw documents into structured fine-tuning, RAG, and evaluation datasets when the real job is dataset preparation, not generic document parsing."},"skills_sh_url":"https://skills.sh/agentskillexchange/skills/generate-llm-fine-tuning-rag-and-eval-datasets-from-source-material-with-easy-dataset"},"updatedAt":"2026-05-18T19:10:34.231Z"}}