Content Core
Extracts content from diverse media sources including URLs, documents, videos, audio files, and images using intellig...
What it does
Extracts content from diverse media sources including URLs, documents, videos, audio files, and images using intelligent auto-detection and multiple extraction engines for unified content processing and analysis.
Content Core MCP Server provides intelligent content extraction from diverse media sources including URLs, documents (PDF, Office files, EPUB), videos, audio files, and images through a unified interface. Built by Luis Novo, it leverages multiple extraction engines with smart auto-detection - using Docling for documents when available, falling back to PyMuPDF, and supporting Firecrawl, Jina, or BeautifulSoup for web content based on API availability. The server handles complex workflows like YouTube transcript extraction, audio/video transcription via OpenAI Whisper, and OCR for images, making it valuable for research automation, content analysis, and building AI agents that need to process mixed media content without manual preprocessing.
Capabilities
Server
Quality
deterministic score 0.80 from registry signals: · indexed on pulsemcp · has source repo · 147 github stars · registry-generated description present