{"id":"03a17887-1306-4e0e-bbd0-b18a8193cbf9","shortId":"RWDvgu","kind":"skill","title":"Whishper Self-Hosted Speech-to-Text and Audio Workflow Skill","tagline":"Whishper is an open source self-hosted web app for speech-to-text, translation, and subtitle workflows built around Whisper models. This skill covers running Whishper with Docker, handling uploads and transcripts, and wiring the output into broader automation flows.","description":"# Whishper Self-Hosted Speech-to-Text and Audio Workflow Skill\n\nWhishper is an open source self-hosted web app for speech-to-text, translation, and subtitle workflows built around Whisper models. This skill covers running Whishper with Docker, handling uploads and transcripts, and wiring the output into broader automation flows.\n\n## Prerequisites\n\nDocker\n\n## Installation\n\nRequirements and caveats from upstream:\n- [![](https://img.shields.io/docker/pulls/pluja/whishper?style=for-the-badge&logo=docker&logoColor=white)](https://hub.docker.com/r/pluja/whishper)\n\nBasic usage or getting-started notes:\n- [x] 👍 **Quick and easy setup**: use the quick start script, or run through a few steps!\n- [x] 🐎 **CPU support**: no GPU? No problem! Whishper can run on CPU too.\n- These screenshots are available on [the official website](https://whishper-docs.pages.dev/usage/transcriptions/), click any of the following links to see:\n\n- Source: https://github.com/pluja/whishper\n- Extracted from upstream docs: https://raw.githubusercontent.com/pluja/whishper/HEAD/README.md\n\n## Documentation\n\n- https://whishper-docs.pages.dev/guides/install/\n\n## Source\n\n- [Agent Skill Exchange](https://agentskillexchange.com/skills/whishper-self-hosted-speech-to-text-audio-workflow-skill/)","tags":["whishper","self","hosted","speech","text","audio","workflow","skill","skills","agentskillexchange","agent-skills","ai-agents"],"capabilities":["skill","source-agentskillexchange","skill-whishper-self-hosted-speech-to-text-audio-workflow-skill","topic-agent-skills","topic-ai-agents","topic-ai-tools","topic-awesome-list","topic-claude-code","topic-codex","topic-cursor","topic-llm","topic-mcp","topic-npx-skills","topic-openclaw","topic-skills-catalog"],"categories":["skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/agentskillexchange/skills/whishper-self-hosted-speech-to-text-audio-workflow-skill","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add agentskillexchange/skills","source_repo":"https://github.com/agentskillexchange/skills","install_from":"skills.sh"}},"qualityScore":"0.454","qualityRationale":"deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,269 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:13:05.215Z","embedding":null,"createdAt":"2026-05-18T13:20:17.551Z","updatedAt":"2026-05-18T19:13:05.215Z","lastSeenAt":"2026-05-18T19:13:05.215Z","tsv":"'/docker/pulls/pluja/whishper?style=for-the-badge&logo=docker&logocolor=white)](https://hub.docker.com/r/pluja/whishper)':119 '/guides/install/':189 '/pluja/whishper':178 '/pluja/whishper/head/readme.md':185 '/skills/whishper-self-hosted-speech-to-text-audio-workflow-skill/)':196 '/usage/transcriptions/),':166 'agent':191 'agentskillexchange.com':195 'agentskillexchange.com/skills/whishper-self-hosted-speech-to-text-audio-workflow-skill/)':194 'app':22,76 'around':33,87 'audio':10,64 'autom':53,107 'avail':159 'basic':120 'broader':52,106 'built':32,86 'caveat':114 'click':167 'cover':38,92 'cpu':144,154 'doc':182 'docker':42,96,110 'document':186 'easi':130 'exchang':193 'extract':179 'flow':54,108 'follow':171 'get':124 'getting-start':123 'github.com':177 'github.com/pluja/whishper':176 'gpu':147 'handl':43,97 'host':4,20,58,74 'img.shields.io':118 'img.shields.io/docker/pulls/pluja/whishper?style=for-the-badge&logo=docker&logocolor=white)](https://hub.docker.com/r/pluja/whishper)':117 'instal':111 'link':172 'model':35,89 'note':126 'offici':162 'open':16,70 'output':50,104 'prerequisit':109 'problem':149 'quick':128,134 'raw.githubusercontent.com':184 'raw.githubusercontent.com/pluja/whishper/head/readme.md':183 'requir':112 'run':39,93,138,152 'screenshot':157 'script':136 'see':174 'self':3,19,57,73 'self-host':2,18,56,72 'setup':131 'skill':12,37,66,91,192 'skill-whishper-self-hosted-speech-to-text-audio-workflow-skill' 'sourc':17,71,175,190 'source-agentskillexchange' 'speech':6,25,60,79 'speech-to-text':5,24,59,78 'start':125,135 'step':142 'subtitl':30,84 'support':145 'text':8,27,62,81 'topic-agent-skills' 'topic-ai-agents' 'topic-ai-tools' 'topic-awesome-list' 'topic-claude-code' 'topic-codex' 'topic-cursor' 'topic-llm' 'topic-mcp' 'topic-npx-skills' 'topic-openclaw' 'topic-skills-catalog' 'transcript':46,100 'translat':28,82 'upload':44,98 'upstream':116,181 'usag':121 'use':132 'web':21,75 'websit':163 'whishper':1,13,40,55,67,94,150 'whishper-docs.pages.dev':165,188 'whishper-docs.pages.dev/guides/install/':187 'whishper-docs.pages.dev/usage/transcriptions/),':164 'whisper':34,88 'wire':48,102 'workflow':11,31,65,85 'x':127,143","prices":[{"id":"80a42874-9ac5-4858-9710-28333c00440c","listingId":"03a17887-1306-4e0e-bbd0-b18a8193cbf9","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"agentskillexchange","category":"skills","install_from":"skills.sh"},"createdAt":"2026-05-18T13:20:17.551Z"}],"sources":[{"listingId":"03a17887-1306-4e0e-bbd0-b18a8193cbf9","source":"github","sourceId":"agentskillexchange/skills/whishper-self-hosted-speech-to-text-audio-workflow-skill","sourceUrl":"https://github.com/agentskillexchange/skills/tree/main/skills/whishper-self-hosted-speech-to-text-audio-workflow-skill","isPrimary":false,"firstSeenAt":"2026-05-18T13:20:17.551Z","lastSeenAt":"2026-05-18T19:13:05.215Z"}],"details":{"listingId":"03a17887-1306-4e0e-bbd0-b18a8193cbf9","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"agentskillexchange","slug":"whishper-self-hosted-speech-to-text-audio-workflow-skill","github":{"repo":"agentskillexchange/skills","stars":8,"topics":["agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex","cursor","llm","mcp","npx-skills","openclaw","skills-catalog"],"license":"mit","html_url":"https://github.com/agentskillexchange/skills","pushed_at":"2026-05-18T19:02:17Z","description":"The open catalog of AI agent skills — 2,000+ security-scanned skills for Claude Code, Cursor, Codex, and more.","skill_md_sha":"63489aa1977e0515fc5506b540f8c3f1df372139","skill_md_path":"skills/whishper-self-hosted-speech-to-text-audio-workflow-skill/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/agentskillexchange/skills/tree/main/skills/whishper-self-hosted-speech-to-text-audio-workflow-skill"},"layout":"multi","source":"github","category":"skills","frontmatter":{"name":"Whishper Self-Hosted Speech-to-Text and Audio Workflow Skill","description":"Whishper is an open source self-hosted web app for speech-to-text, translation, and subtitle workflows built around Whisper models. This skill covers running Whishper with Docker, handling uploads and transcripts, and wiring the output into broader automation flows."},"skills_sh_url":"https://skills.sh/agentskillexchange/skills/whishper-self-hosted-speech-to-text-audio-workflow-skill"},"updatedAt":"2026-05-18T19:13:05.215Z"}}