Docs Scraper
Scrapes web-based documentation through multiple crawling strategies like sitemap traversal and menu navigation, enab...
What it does
Scrapes web-based documentation through multiple crawling strategies like sitemap traversal and menu navigation, enabling structured content extraction with rate limiting and robots.txt compliance
Documentation scraping server that enables AI assistants to extract structured content from web-based documentation through multiple crawling strategies. Built with Python and the crawl4ai library, it provides tools for single URL crawling, multi-URL batch processing, sitemap-based crawling, and menu-driven navigation extraction, with features like rate limiting, concurrent request handling, and robots.txt compliance. The implementation is particularly valuable for users who need to ingest documentation into AI systems while respecting site access policies and maintaining clean markdown output.
Capabilities
Server
Quality
deterministic score 0.55 from registry signals: · indexed on pulsemcp · has source repo · 2 github stars · registry-generated description present