{"id":"b255c97e-712b-4d7d-9222-2335df18d3fc","shortId":"2cH3DN","kind":"skill","title":"Kokoro FastAPI OpenAI-Compatible Text-to-Speech Server","tagline":"Kokoro-FastAPI is a Dockerized FastAPI wrapper around the Kokoro-82M text-to-speech model with OpenAI-compatible speech endpoints. It supports local TTS serving, multi-language synthesis, web UI access, and timestamped audio generation workflows.","description":"# Kokoro FastAPI OpenAI-Compatible Text-to-Speech Server\n\nKokoro-FastAPI is a Dockerized FastAPI wrapper around the Kokoro-82M text-to-speech model with OpenAI-compatible speech endpoints. It supports local TTS serving, multi-language synthesis, web UI access, and timestamped audio generation workflows.\n\n## Prerequisites\n\nDocker\n\n## Installation\n\nUse the upstream install or setup path that matches your environment:\n- docker run -p 8880:8880 ghcr.io/remsky/kokoro-fastapi-cpu:latest # CPU, or:\n- docker run --gpus all -p 8880:8880 ghcr.io/remsky/kokoro-fastapi-gpu:latest # NVIDIA GPU, or:\n- docker run --device=/dev/kfd --device=/dev/dri -p 8880:8880 ghcr.io/remsky/kokoro-fastapi-rocm:latest # AMD GPU (ROCm, experimental, amd64 only)\n- git clone https://github.com/remsky/Kokoro-FastAPI.git\n\nRequirements and caveats from upstream:\n- <summary>Quickest Start (docker run)</summary>\n- <summary>Quick Start (docker compose) </summary>\n- Install prerequisites, and start the service using Docker Compose (Full setup including UI):\n\nBasic usage or getting-started notes:\n- Pre built images are available to run, with arm/multi-arch support, and baked in models\n- ### Named versions should be pinned for your regular usage.\n\n- Source: https://github.com/remsky/Kokoro-FastAPI\n- Extracted from upstream docs: https://raw.githubusercontent.com/remsky/Kokoro-FastAPI/HEAD/README.md\n\n## Documentation\n\n- https://github.com/remsky/Kokoro-FastAPI/wiki\n\n## Source\n\n- [Agent Skill Exchange](https://agentskillexchange.com/skills/kokoro-fastapi-openai-compatible-text-to-speech-server/)","tags":["kokoro","fastapi","openai","compatible","text","speech","server","skills","agentskillexchange","agent-skills","ai-agents","ai-tools"],"capabilities":["skill","source-agentskillexchange","skill-kokoro-fastapi-openai-compatible-text-to-speech-server","topic-agent-skills","topic-ai-agents","topic-ai-tools","topic-awesome-list","topic-claude-code","topic-codex","topic-cursor","topic-llm","topic-mcp","topic-npx-skills","topic-openclaw","topic-skills-catalog"],"categories":["skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/agentskillexchange/skills/kokoro-fastapi-openai-compatible-text-to-speech-server","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add agentskillexchange/skills","source_repo":"https://github.com/agentskillexchange/skills","install_from":"skills.sh"}},"qualityScore":"0.454","qualityRationale":"deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,577 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:11:04.167Z","embedding":null,"createdAt":"2026-05-18T13:17:23.514Z","updatedAt":"2026-05-18T19:11:04.167Z","lastSeenAt":"2026-05-18T19:11:04.167Z","tsv":"'/dev/dri':145 '/dev/kfd':143 '/remsky/kokoro-fastapi':222 '/remsky/kokoro-fastapi-cpu:latest':124 '/remsky/kokoro-fastapi-gpu:latest':136 '/remsky/kokoro-fastapi-rocm:latest':151 '/remsky/kokoro-fastapi.git':162 '/remsky/kokoro-fastapi/head/readme.md':229 '/remsky/kokoro-fastapi/wiki':233 '/skills/kokoro-fastapi-openai-compatible-text-to-speech-server/)':240 '82m':23,74 '8880':120,121,132,133,147,148 'access':46,97 'agent':235 'agentskillexchange.com':239 'agentskillexchange.com/skills/kokoro-fastapi-openai-compatible-text-to-speech-server/)':238 'amd':152 'amd64':156 'arm/multi-arch':204 'around':19,70 'audio':49,100 'avail':200 'bake':207 'basic':189 'built':197 'caveat':165 'clone':159 'compat':5,32,56,83 'compos':175,184 'cpu':125 'devic':142,144 'doc':226 'docker':16,67,104,117,127,140,170,174,183 'document':230 'endpoint':34,85 'environ':116 'exchang':237 'experiment':155 'extract':223 'fastapi':2,13,17,53,64,68 'full':185 'generat':50,101 'get':193 'getting-start':192 'ghcr.io':123,135,150 'ghcr.io/remsky/kokoro-fastapi-cpu:latest':122 'ghcr.io/remsky/kokoro-fastapi-gpu:latest':134 'ghcr.io/remsky/kokoro-fastapi-rocm:latest':149 'git':158 'github.com':161,221,232 'github.com/remsky/kokoro-fastapi':220 'github.com/remsky/kokoro-fastapi.git':160 'github.com/remsky/kokoro-fastapi/wiki':231 'gpu':138,153 'gpus':129 'imag':198 'includ':187 'instal':105,109,176 'kokoro':1,12,22,52,63,73 'kokoro-82m':21,72 'kokoro-fastapi':11,62 'languag':42,93 'local':37,88 'match':114 'model':28,79,209 'multi':41,92 'multi-languag':40,91 'name':210 'note':195 'nvidia':137 'openai':4,31,55,82 'openai-compat':3,30,54,81 'p':119,131,146 'path':112 'pin':214 'pre':196 'prerequisit':103,177 'quick':172 'quickest':168 'raw.githubusercontent.com':228 'raw.githubusercontent.com/remsky/kokoro-fastapi/head/readme.md':227 'regular':217 'requir':163 'rocm':154 'run':118,128,141,171,202 'serv':39,90 'server':10,61 'servic':181 'setup':111,186 'skill':236 'skill-kokoro-fastapi-openai-compatible-text-to-speech-server' 'sourc':219,234 'source-agentskillexchange' 'speech':9,27,33,60,78,84 'start':169,173,179,194 'support':36,87,205 'synthesi':43,94 'text':7,25,58,76 'text-to-speech':6,24,57,75 'timestamp':48,99 'topic-agent-skills' 'topic-ai-agents' 'topic-ai-tools' 'topic-awesome-list' 'topic-claude-code' 'topic-codex' 'topic-cursor' 'topic-llm' 'topic-mcp' 'topic-npx-skills' 'topic-openclaw' 'topic-skills-catalog' 'tts':38,89 'ui':45,96,188 'upstream':108,167,225 'usag':190,218 'use':106,182 'version':211 'web':44,95 'workflow':51,102 'wrapper':18,69","prices":[{"id":"9f003cb6-281f-48a8-80c9-dcbd249b879b","listingId":"b255c97e-712b-4d7d-9222-2335df18d3fc","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"agentskillexchange","category":"skills","install_from":"skills.sh"},"createdAt":"2026-05-18T13:17:23.514Z"}],"sources":[{"listingId":"b255c97e-712b-4d7d-9222-2335df18d3fc","source":"github","sourceId":"agentskillexchange/skills/kokoro-fastapi-openai-compatible-text-to-speech-server","sourceUrl":"https://github.com/agentskillexchange/skills/tree/main/skills/kokoro-fastapi-openai-compatible-text-to-speech-server","isPrimary":false,"firstSeenAt":"2026-05-18T13:17:23.514Z","lastSeenAt":"2026-05-18T19:11:04.167Z"}],"details":{"listingId":"b255c97e-712b-4d7d-9222-2335df18d3fc","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"agentskillexchange","slug":"kokoro-fastapi-openai-compatible-text-to-speech-server","github":{"repo":"agentskillexchange/skills","stars":8,"topics":["agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex","cursor","llm","mcp","npx-skills","openclaw","skills-catalog"],"license":"mit","html_url":"https://github.com/agentskillexchange/skills","pushed_at":"2026-05-18T19:02:17Z","description":"The open catalog of AI agent skills — 2,000+ security-scanned skills for Claude Code, Cursor, Codex, and more.","skill_md_sha":"1c9546d480d212340c279aed9a6d247713762a9b","skill_md_path":"skills/kokoro-fastapi-openai-compatible-text-to-speech-server/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/agentskillexchange/skills/tree/main/skills/kokoro-fastapi-openai-compatible-text-to-speech-server"},"layout":"multi","source":"github","category":"skills","frontmatter":{"name":"Kokoro FastAPI OpenAI-Compatible Text-to-Speech Server","description":"Kokoro-FastAPI is a Dockerized FastAPI wrapper around the Kokoro-82M text-to-speech model with OpenAI-compatible speech endpoints. It supports local TTS serving, multi-language synthesis, web UI access, and timestamped audio generation workflows."},"skills_sh_url":"https://skills.sh/agentskillexchange/skills/kokoro-fastapi-openai-compatible-text-to-speech-server"},"updatedAt":"2026-05-18T19:11:04.167Z"}}