{"id":"4197e46e-7db0-417a-bb8f-fb2132e3994a","shortId":"SrJw2d","kind":"skill","title":"Extract clean article Markdown from web pages with Defuddle","tagline":"Use Defuddle when an agent needs clean, metadata-rich article text or Markdown from noisy web pages before summarizing, indexing, or archiving them.","description":"# Extract clean article Markdown from web pages with Defuddle\n\nUse Defuddle when an agent needs clean, metadata-rich article text or Markdown from noisy web pages before summarizing, indexing, or archiving them.\n\n## Prerequisites\n\nNode.js, npx or npm, defuddle CLI\n\n## Installation\n\nUse the upstream install or setup path that matches your environment:\n- npx defuddle parse page.html\n- npx defuddle parse https://example.com/article\n- npx defuddle parse page.html --markdown\n- npx defuddle parse page.html --json\n\nRequirements and caveats from upstream:\n- ### Node.js\n- defuddle/node accepts a DOM Document from any implementation (JSDOM, linkedom, happy-dom, etc.).\n- import { Defuddle } from 'defuddle/node';\n\nBasic usage or getting-started notes:\n- Defuddle takes a URL or HTML, finds the main content, and returns cleaned HTML or Markdown. Defuddle was created for the browser extension [Obsidian Web Clipper](https://github.com/obsidianmd/obsidian-clipper), but it...\n- ### Browser\n- javascript\n\n- Source: https://github.com/kepano/defuddle\n- Extracted from upstream docs: https://raw.githubusercontent.com/kepano/defuddle/HEAD/README.md\n\n## Documentation\n\n- https://defuddle.md\n\n## Source\n\n- [Agent Skill Exchange](https://agentskillexchange.com/skills/extract-clean-article-markdown-from-web-pages-with-defuddle/)","tags":["extract","clean","article","markdown","from","web","pages","with","defuddle","skills","agentskillexchange","agent-skills"],"capabilities":["skill","source-agentskillexchange","skill-extract-clean-article-markdown-from-web-pages-with-defuddle","topic-agent-skills","topic-ai-agents","topic-ai-tools","topic-awesome-list","topic-claude-code","topic-codex","topic-cursor","topic-llm","topic-mcp","topic-npx-skills","topic-openclaw","topic-skills-catalog"],"categories":["skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/agentskillexchange/skills/extract-clean-article-markdown-from-web-pages-with-defuddle","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add agentskillexchange/skills","source_repo":"https://github.com/agentskillexchange/skills","install_from":"skills.sh"}},"qualityScore":"0.454","qualityRationale":"deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,320 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:10:24.138Z","embedding":null,"createdAt":"2026-05-18T13:16:28.432Z","updatedAt":"2026-05-18T19:10:24.138Z","lastSeenAt":"2026-05-18T19:10:24.138Z","tsv":"'/article':95 '/kepano/defuddle':173 '/kepano/defuddle/head/readme.md':180 '/obsidianmd/obsidian-clipper),':165 '/skills/extract-clean-article-markdown-from-web-pages-with-defuddle/)':189 'accept':113 'agent':14,47,184 'agentskillexchange.com':188 'agentskillexchange.com/skills/extract-clean-article-markdown-from-web-pages-with-defuddle/)':187 'archiv':32,65 'articl':3,20,36,53 'basic':130 'browser':158,168 'caveat':108 'clean':2,16,35,49,149 'cli':73 'clipper':162 'content':146 'creat':155 'defuddl':9,11,42,44,72,87,91,97,102,127,137,153 'defuddle.md':182 'defuddle/node':112,129 'doc':177 'document':116,181 'dom':115,124 'environ':85 'etc':125 'example.com':94 'example.com/article':93 'exchang':186 'extens':159 'extract':1,34,174 'find':143 'get':134 'getting-start':133 'github.com':164,172 'github.com/kepano/defuddle':171 'github.com/obsidianmd/obsidian-clipper),':163 'happi':123 'happy-dom':122 'html':142,150 'implement':119 'import':126 'index':30,63 'instal':74,78 'javascript':169 'jsdom':120 'json':105 'linkedom':121 'main':145 'markdown':4,23,37,56,100,152 'match':83 'metadata':18,51 'metadata-rich':17,50 'need':15,48 'node.js':68,111 'noisi':25,58 'note':136 'npm':71 'npx':69,86,90,96,101 'obsidian':160 'page':7,27,40,60 'page.html':89,99,104 'pars':88,92,98,103 'path':81 'prerequisit':67 'raw.githubusercontent.com':179 'raw.githubusercontent.com/kepano/defuddle/head/readme.md':178 'requir':106 'return':148 'rich':19,52 'setup':80 'skill':185 'skill-extract-clean-article-markdown-from-web-pages-with-defuddle' 'sourc':170,183 'source-agentskillexchange' 'start':135 'summar':29,62 'take':138 'text':21,54 'topic-agent-skills' 'topic-ai-agents' 'topic-ai-tools' 'topic-awesome-list' 'topic-claude-code' 'topic-codex' 'topic-cursor' 'topic-llm' 'topic-mcp' 'topic-npx-skills' 'topic-openclaw' 'topic-skills-catalog' 'upstream':77,110,176 'url':140 'usag':131 'use':10,43,75 'web':6,26,39,59,161","prices":[{"id":"5449497a-3029-4533-87b2-5f8156d3ace8","listingId":"4197e46e-7db0-417a-bb8f-fb2132e3994a","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"agentskillexchange","category":"skills","install_from":"skills.sh"},"createdAt":"2026-05-18T13:16:28.432Z"}],"sources":[{"listingId":"4197e46e-7db0-417a-bb8f-fb2132e3994a","source":"github","sourceId":"agentskillexchange/skills/extract-clean-article-markdown-from-web-pages-with-defuddle","sourceUrl":"https://github.com/agentskillexchange/skills/tree/main/skills/extract-clean-article-markdown-from-web-pages-with-defuddle","isPrimary":false,"firstSeenAt":"2026-05-18T13:16:28.432Z","lastSeenAt":"2026-05-18T19:10:24.138Z"}],"details":{"listingId":"4197e46e-7db0-417a-bb8f-fb2132e3994a","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"agentskillexchange","slug":"extract-clean-article-markdown-from-web-pages-with-defuddle","github":{"repo":"agentskillexchange/skills","stars":8,"topics":["agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex","cursor","llm","mcp","npx-skills","openclaw","skills-catalog"],"license":"mit","html_url":"https://github.com/agentskillexchange/skills","pushed_at":"2026-05-18T19:02:17Z","description":"The open catalog of AI agent skills — 2,000+ security-scanned skills for Claude Code, Cursor, Codex, and more.","skill_md_sha":"5d0259c57c6e2e3ec5dee321c58f4fc19a6f2085","skill_md_path":"skills/extract-clean-article-markdown-from-web-pages-with-defuddle/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/agentskillexchange/skills/tree/main/skills/extract-clean-article-markdown-from-web-pages-with-defuddle"},"layout":"multi","source":"github","category":"skills","frontmatter":{"name":"Extract clean article Markdown from web pages with Defuddle","description":"Use Defuddle when an agent needs clean, metadata-rich article text or Markdown from noisy web pages before summarizing, indexing, or archiving them."},"skills_sh_url":"https://skills.sh/agentskillexchange/skills/extract-clean-article-markdown-from-web-pages-with-defuddle"},"updatedAt":"2026-05-18T19:10:24.138Z"}}