{"id":"223bfeb6-0abc-4951-9329-fddc66fcd33c","shortId":"TTwqu9","kind":"skill","title":"OpenAI Whisper Batch Transcription Pipeline","tagline":"Processes audio files from an S3 bucket using Whisper large-v3, splitting recordings into 30-second chunks with ffmpeg before transcription. Outputs timestamped SRT and VTT subtitle files plus plain-text transcripts, then uploads artifacts back to S3. Supports language auto-detec","description":"# OpenAI Whisper Batch Transcription Pipeline\n\nProcesses audio files from an S3 bucket using Whisper large-v3, splitting recordings into 30-second chunks with ffmpeg before transcription. Outputs timestamped SRT and VTT subtitle files plus plain-text transcripts, then uploads artifacts back to S3. Supports language auto-detection and translation to English.\n\n## Installation\n\nUse the upstream install or setup path that matches your environment:\n- pip install -U openai-whisper\n- pip install git+https://github.com/openai/whisper.git\n- pip install --upgrade --no-deps --force-reinstall git+https://github.com/openai/whisper.git\n- brew install ffmpeg\n\nRequirements and caveats from upstream:\n- We used Python 3.9.9 and [PyTorch](https://pytorch.org/) 1.10.1 to train and test our models, but the codebase is expected to be compatible with Python 3.8-3.11 and recent PyTorch versions. The codebase also depends o...\n- Alternatively, the following command will pull and install the latest commit from this repository, along with its Python dependencies:\n- It also requires the command-line tool [ffmpeg](https://ffmpeg.org/) to be installed on your system, which is available from most package managers:\n\nBasic usage or getting-started notes:\n- [[Colab example]](https://colab.research.google.com/github/openai/whisper/blob/master/notebooks/LibriSpeech.ipynb)\n- To update the package to the latest version of this repository, please run:\n- bash\n\n- Source: https://github.com/openai/whisper\n- Extracted from upstream docs: https://raw.githubusercontent.com/openai/whisper/HEAD/README.md\n\n## Source\n\n- [Agent Skill Exchange](https://agentskillexchange.com/skills/whisper-batch-transcription-pipeline/)","tags":["whisper","batch","transcription","pipeline","skills","agentskillexchange","agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex"],"capabilities":["skill","source-agentskillexchange","skill-whisper-batch-transcription-pipeline","topic-agent-skills","topic-ai-agents","topic-ai-tools","topic-awesome-list","topic-claude-code","topic-codex","topic-cursor","topic-llm","topic-mcp","topic-npx-skills","topic-openclaw","topic-skills-catalog"],"categories":["skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/agentskillexchange/skills/whisper-batch-transcription-pipeline","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add agentskillexchange/skills","source_repo":"https://github.com/agentskillexchange/skills","install_from":"skills.sh"}},"qualityScore":"0.454","qualityRationale":"deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,698 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:13:05.318Z","embedding":null,"createdAt":"2026-05-18T13:20:17.737Z","updatedAt":"2026-05-18T19:13:05.318Z","lastSeenAt":"2026-05-18T19:13:05.318Z","tsv":"'-3.11':177 '/)':158,217 '/github/openai/whisper/blob/master/notebooks/librispeech.ipynb)':242 '/openai/whisper':260 '/openai/whisper.git':128,141 '/openai/whisper/head/readme.md':267 '/skills/whisper-batch-transcription-pipeline/)':274 '1.10.1':159 '3.8':176 '3.9.9':153 '30':21,71 'agent':269 'agentskillexchange.com':273 'agentskillexchange.com/skills/whisper-batch-transcription-pipeline/)':272 'along':201 'also':184,207 'altern':187 'artifact':42,92 'audio':7,57 'auto':49,99 'auto-detec':48 'auto-detect':98 'avail':226 'back':43,93 'bash':256 'basic':231 'batch':3,53 'brew':142 'bucket':12,62 'caveat':147 'chunk':23,73 'codebas':168,183 'colab':238 'colab.research.google.com':241 'colab.research.google.com/github/openai/whisper/blob/master/notebooks/librispeech.ipynb)':240 'command':190,211 'command-lin':210 'commit':197 'compat':173 'dep':134 'depend':185,205 'detec':50 'detect':100 'doc':264 'english':104 'environ':116 'exampl':239 'exchang':271 'expect':170 'extract':261 'ffmpeg':25,75,144,214 'ffmpeg.org':216 'ffmpeg.org/)':215 'file':8,34,58,84 'follow':189 'forc':136 'force-reinstal':135 'get':235 'getting-start':234 'git':125,138 'github.com':127,140,259 'github.com/openai/whisper':258 'github.com/openai/whisper.git':126,139 'instal':105,109,118,124,130,143,194,220 'languag':47,97 'larg':16,66 'large-v3':15,65 'latest':196,249 'line':212 'manag':230 'match':114 'model':165 'no-dep':132 'note':237 'o':186 'openai':1,51,121 'openai-whisp':120 'output':28,78 'packag':229,246 'path':112 'pip':117,123,129 'pipelin':5,55 'plain':37,87 'plain-text':36,86 'pleas':254 'plus':35,85 'process':6,56 'pull':192 'python':152,175,204 'pytorch':155,180 'pytorch.org':157 'pytorch.org/)':156 'raw.githubusercontent.com':266 'raw.githubusercontent.com/openai/whisper/head/readme.md':265 'recent':179 'record':19,69 'reinstal':137 'repositori':200,253 'requir':145,208 'run':255 's3':11,45,61,95 'second':22,72 'setup':111 'skill':270 'skill-whisper-batch-transcription-pipeline' 'sourc':257,268 'source-agentskillexchange' 'split':18,68 'srt':30,80 'start':236 'subtitl':33,83 'support':46,96 'system':223 'test':163 'text':38,88 'timestamp':29,79 'tool':213 'topic-agent-skills' 'topic-ai-agents' 'topic-ai-tools' 'topic-awesome-list' 'topic-claude-code' 'topic-codex' 'topic-cursor' 'topic-llm' 'topic-mcp' 'topic-npx-skills' 'topic-openclaw' 'topic-skills-catalog' 'train':161 'transcript':4,27,39,54,77,89 'translat':102 'u':119 'updat':244 'upgrad':131 'upload':41,91 'upstream':108,149,263 'usag':232 'use':13,63,106,151 'v3':17,67 'version':181,250 'vtt':32,82 'whisper':2,14,52,64,122","prices":[{"id":"45b01b8a-2d05-4d8a-9de6-5843c7f7b8b0","listingId":"223bfeb6-0abc-4951-9329-fddc66fcd33c","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"agentskillexchange","category":"skills","install_from":"skills.sh"},"createdAt":"2026-05-18T13:20:17.737Z"}],"sources":[{"listingId":"223bfeb6-0abc-4951-9329-fddc66fcd33c","source":"github","sourceId":"agentskillexchange/skills/whisper-batch-transcription-pipeline","sourceUrl":"https://github.com/agentskillexchange/skills/tree/main/skills/whisper-batch-transcription-pipeline","isPrimary":false,"firstSeenAt":"2026-05-18T13:20:17.737Z","lastSeenAt":"2026-05-18T19:13:05.318Z"}],"details":{"listingId":"223bfeb6-0abc-4951-9329-fddc66fcd33c","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"agentskillexchange","slug":"whisper-batch-transcription-pipeline","github":{"repo":"agentskillexchange/skills","stars":8,"topics":["agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex","cursor","llm","mcp","npx-skills","openclaw","skills-catalog"],"license":"mit","html_url":"https://github.com/agentskillexchange/skills","pushed_at":"2026-05-18T19:02:17Z","description":"The open catalog of AI agent skills — 2,000+ security-scanned skills for Claude Code, Cursor, Codex, and more.","skill_md_sha":"8021e5e3690d4db4da8e944f22ed8fb767a2bbe0","skill_md_path":"skills/whisper-batch-transcription-pipeline/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/agentskillexchange/skills/tree/main/skills/whisper-batch-transcription-pipeline"},"layout":"multi","source":"github","category":"skills","frontmatter":{"name":"OpenAI Whisper Batch Transcription Pipeline","description":"Processes audio files from an S3 bucket using Whisper large-v3, splitting recordings into 30-second chunks with ffmpeg before transcription. Outputs timestamped SRT and VTT subtitle files plus plain-text transcripts, then uploads artifacts back to S3. Supports language auto-detection and translation to English."},"skills_sh_url":"https://skills.sh/agentskillexchange/skills/whisper-batch-transcription-pipeline"},"updatedAt":"2026-05-18T19:13:05.318Z"}}