{"id":"4700be41-8c00-498f-8b17-31ae33a80532","shortId":"3SnfX9","kind":"skill","title":"Newspaper4k Python Article Extraction and NLP Library","tagline":"Newspaper4k is an actively maintained fork of the popular Newspaper3k library for Python. It extracts articles, titles, images, authors, and metadata from news websites, with built-in NLP for keyword extraction and text summarization.","description":"# Newspaper4k Python Article Extraction and NLP Library\n\nNewspaper4k is an actively maintained fork of the popular Newspaper3k library for Python. It extracts articles, titles, images, authors, and metadata from news websites, with built-in NLP for keyword extraction and text summarization.\n\n## Installation\n\nUse the upstream install or setup path that matches your environment:\n- pip install newspaper4k\n- pip install newspaper4k[gnews]\n- brew install libxml2 libxslt\n- brew install libtiff libjpeg webp little-cms2\n\nRequirements and caveats from upstream:\n- ## Python compatibility\n- Python 3.10+ minimum\n- python -m newspaper --url=\"https://edition.cnn.com/2023/11/17/success/job-seekers-use-ai/index.html\" --language=en --output-format=json --output-file=article.json\n\nBasic usage or getting-started notes:\n- bash\n- ## Using the CLI\n- You can start directly from the command line, using the included CLI:\n\n- Source: https://github.com/AndyTheFactory/newspaper4k\n- Extracted from upstream docs: https://raw.githubusercontent.com/AndyTheFactory/newspaper4k/HEAD/README.md\n\n## Source\n\n- [Agent Skill Exchange](https://agentskillexchange.com/skills/newspaper4k-python-article-extraction-nlp/)","tags":["newspaper4k","python","article","extraction","nlp","skills","agentskillexchange","agent-skills","ai-agents","ai-tools","awesome-list","claude-code"],"capabilities":["skill","source-agentskillexchange","skill-newspaper4k-python-article-extraction-nlp","topic-agent-skills","topic-ai-agents","topic-ai-tools","topic-awesome-list","topic-claude-code","topic-codex","topic-cursor","topic-llm","topic-mcp","topic-npx-skills","topic-openclaw","topic-skills-catalog"],"categories":["skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/agentskillexchange/skills/newspaper4k-python-article-extraction-nlp","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add agentskillexchange/skills","source_repo":"https://github.com/agentskillexchange/skills","install_from":"skills.sh"}},"qualityScore":"0.454","qualityRationale":"deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,193 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:11:24.368Z","embedding":null,"createdAt":"2026-05-18T13:17:53.340Z","updatedAt":"2026-05-18T19:11:24.368Z","lastSeenAt":"2026-05-18T19:11:24.368Z","tsv":"'/2023/11/17/success/job-seekers-use-ai/index.html':132 '/andythefactory/newspaper4k':169 '/andythefactory/newspaper4k/head/readme.md':176 '/skills/newspaper4k-python-article-extraction-nlp/)':183 '3.10':124 'activ':11,53 'agent':178 'agentskillexchange.com':182 'agentskillexchange.com/skills/newspaper4k-python-article-extraction-nlp/)':181 'articl':3,23,45,65 'article.json':142 'author':26,68 'bash':150 'basic':143 'brew':104,108 'built':34,76 'built-in':33,75 'caveat':118 'cli':153,165 'cms2':115 'command':160 'compat':122 'direct':157 'doc':173 'edition.cnn.com':131 'edition.cnn.com/2023/11/17/success/job-seekers-use-ai/index.html':130 'en':134 'environ':96 'exchang':180 'extract':4,22,39,46,64,81,170 'file':141 'fork':13,55 'format':137 'get':147 'getting-start':146 'github.com':168 'github.com/andythefactory/newspaper4k':167 'gnew':103 'imag':25,67 'includ':164 'instal':85,89,98,101,105,109 'json':138 'keyword':38,80 'languag':133 'libjpeg':111 'librari':7,18,49,60 'libtiff':110 'libxml2':106 'libxslt':107 'line':161 'littl':114 'little-cms2':113 'm':127 'maintain':12,54 'match':94 'metadata':28,70 'minimum':125 'news':30,72 'newspap':128 'newspaper3k':17,59 'newspaper4k':1,8,43,50,99,102 'nlp':6,36,48,78 'note':149 'output':136,140 'output-fil':139 'output-format':135 'path':92 'pip':97,100 'popular':16,58 'python':2,20,44,62,121,123,126 'raw.githubusercontent.com':175 'raw.githubusercontent.com/andythefactory/newspaper4k/head/readme.md':174 'requir':116 'setup':91 'skill':179 'skill-newspaper4k-python-article-extraction-nlp' 'sourc':166,177 'source-agentskillexchange' 'start':148,156 'summar':42,84 'text':41,83 'titl':24,66 'topic-agent-skills' 'topic-ai-agents' 'topic-ai-tools' 'topic-awesome-list' 'topic-claude-code' 'topic-codex' 'topic-cursor' 'topic-llm' 'topic-mcp' 'topic-npx-skills' 'topic-openclaw' 'topic-skills-catalog' 'upstream':88,120,172 'url':129 'usag':144 'use':86,151,162 'webp':112 'websit':31,73","prices":[{"id":"257a93ae-7ae0-4a69-8c73-7540c682121e","listingId":"4700be41-8c00-498f-8b17-31ae33a80532","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"agentskillexchange","category":"skills","install_from":"skills.sh"},"createdAt":"2026-05-18T13:17:53.340Z"}],"sources":[{"listingId":"4700be41-8c00-498f-8b17-31ae33a80532","source":"github","sourceId":"agentskillexchange/skills/newspaper4k-python-article-extraction-nlp","sourceUrl":"https://github.com/agentskillexchange/skills/tree/main/skills/newspaper4k-python-article-extraction-nlp","isPrimary":false,"firstSeenAt":"2026-05-18T13:17:53.340Z","lastSeenAt":"2026-05-18T19:11:24.368Z"}],"details":{"listingId":"4700be41-8c00-498f-8b17-31ae33a80532","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"agentskillexchange","slug":"newspaper4k-python-article-extraction-nlp","github":{"repo":"agentskillexchange/skills","stars":8,"topics":["agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex","cursor","llm","mcp","npx-skills","openclaw","skills-catalog"],"license":"mit","html_url":"https://github.com/agentskillexchange/skills","pushed_at":"2026-05-18T19:02:17Z","description":"The open catalog of AI agent skills — 2,000+ security-scanned skills for Claude Code, Cursor, Codex, and more.","skill_md_sha":"4af2588f4f0278bb6846de7abd8408fd15689724","skill_md_path":"skills/newspaper4k-python-article-extraction-nlp/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/agentskillexchange/skills/tree/main/skills/newspaper4k-python-article-extraction-nlp"},"layout":"multi","source":"github","category":"skills","frontmatter":{"name":"Newspaper4k Python Article Extraction and NLP Library","description":"Newspaper4k is an actively maintained fork of the popular Newspaper3k library for Python. It extracts articles, titles, images, authors, and metadata from news websites, with built-in NLP for keyword extraction and text summarization."},"skills_sh_url":"https://skills.sh/agentskillexchange/skills/newspaper4k-python-article-extraction-nlp"},"updatedAt":"2026-05-18T19:11:24.368Z"}}