Whishper Self-Hosted Speech-to-Text and Audio Workflow Skill
Whishper is an open source self-hosted web app for speech-to-text, translation, and subtitle workflows built around Whisper models. This skill covers running Whishper with Docker, handling uploads and transcripts, and wiring the output into broader automation flows.
What it does
Whishper Self-Hosted Speech-to-Text and Audio Workflow Skill
Whishper is an open source self-hosted web app for speech-to-text, translation, and subtitle workflows built around Whisper models. This skill covers running Whishper with Docker, handling uploads and transcripts, and wiring the output into broader automation flows.
Prerequisites
Docker
Installation
Requirements and caveats from upstream:
Basic usage or getting-started notes:
-
๐ Quick and easy setup: use the quick start script, or run through a few steps!
-
๐ CPU support: no GPU? No problem! Whishper can run on CPU too.
-
These screenshots are available on the official website, click any of the following links to see:
-
Extracted from upstream docs: https://raw.githubusercontent.com/pluja/whishper/HEAD/README.md
Documentation
Source
Capabilities
Install
Quality
deterministic score 0.45 from registry signals: ยท indexed on github topic:agent-skills ยท 8 github stars ยท SKILL.md body (1,269 chars)