Puppeteer Vision Web Scraper
Automates web scraping by intelligently handling cookie banners, CAPTCHAs, and paywalls to extract clean markdown con...
What it does
Automates web scraping by intelligently handling cookie banners, CAPTCHAs, and paywalls to extract clean markdown content from websites
Puppeteer Vision MCP Server provides a tool for scraping webpages and converting them to markdown format using Puppeteer, Mozilla's Readability, and Turndown. Developed by Denis Jannot, it features AI-driven interaction capabilities that automatically handle cookie consent banners, CAPTCHAs, paywalls, and other interactive elements blocking content. The server uses OpenAI's vision models to analyze screenshots of web pages and determine appropriate interactions, with special handling for code blocks, tables, and other structured content. Available as an NPX package with configurable headless mode, it supports both stdio and SSE transport methods, making it particularly valuable for AI assistants that need to extract and process web content without being blocked by interactive elements.
Capabilities
Server
Quality
deterministic score 0.70 from registry signals: · indexed on pulsemcp · has source repo · 48 github stars · registry-generated description present