{"id":"c57ce3ec-975c-4e46-a3c8-bc372ddbea95","shortId":"3Qzx4E","kind":"skill","title":"fish-audio","tagline":"Generate AI audio and synthesize voices with Fish Audio via AceDataCloud API. Use when creating text-to-speech audio, synthesizing voices, or generating audio content. Supports multiple voice models and TTS capabilities.","description":"# Fish Audio — Voice & Audio Synthesis\n\nGenerate AI audio and synthesize voices through AceDataCloud's Fish Audio API.\n\n> **Setup:** See [authentication](../_shared/authentication.md) for token setup.\n\n## Quick Start\n\n```bash\ncurl -X POST https://api.acedata.cloud/fish/audios \\\n  -H \"Authorization: Bearer $ACEDATACLOUD_API_TOKEN\" \\\n  -H \"Content-Type: application/json\" \\\n  -d '{\"prompt\": \"Hello, this is a demonstration of AI voice synthesis.\"}'\n```\n\n> **Async:** See [async task polling](../_shared/async-tasks.md). Poll via `POST /fish/tasks` with `{\"task_id\": \"...\"}`.\n\n## Endpoints\n\n| Endpoint | Purpose |\n|----------|---------|\n| `POST /fish/audios` | Generate audio from text or parameters |\n| `POST /fish/voices` | Voice synthesis and cloning |\n| `POST /fish/tasks` | Poll task status |\n\n## Workflows\n\n### 1. Text-to-Speech\n\n```json\nPOST /fish/audios\n{\n  \"prompt\": \"The quick brown fox jumps over the lazy dog.\",\n  \"voice_id\": \"default\"\n}\n```\n\n### 2. Voice Cloning — Register a Voice\n\nUpload a reference audio to create a cloneable voice.\n\n```json\nPOST /fish/voices\n{\n  \"voice_url\": \"https://example.com/reference-voice.mp3\",\n  \"title\": \"My Custom Voice\",\n  \"description\": \"Clear, neutral-toned speaker for TTS\",\n  \"image_url\": \"https://example.com/avatar.jpg\"\n}\n```\n\n### 3. Text-to-Speech with Cloned Voice\n\n```json\nPOST /fish/audios\n{\n  \"prompt\": \"Welcome to our platform.\",\n  \"voice_id\": \"<voice_id from POST /fish/voices>\"\n}\n```\n\n## Parameters\n\n### `/fish/audios`\n\n| Parameter | Type | Description |\n|-----------|------|-------------|\n| `prompt` | string | Text to synthesize into speech |\n| `voice_id` | string | Voice model or cloned voice ID to use |\n| `model` | string | TTS model (e.g., `\"speech-1.5\"`, `\"speech-1.5-hd\"`) |\n| `action` | string | Operation type (e.g., `\"generate\"`) |\n| `callback_url` | string | Webhook URL for async delivery |\n\n### `/fish/voices`\n\n| Parameter | Type | Description |\n|-----------|------|-------------|\n| `voice_url` | string | Reference audio URL for voice cloning |\n| `title` | string | Display title for the cloned voice |\n| `description` | string | Description of the voice |\n| `image_url` | string | Cover image URL for the voice |\n| `callback_url` | string | Webhook URL for async delivery |\n\n## Gotchas\n\n- Pricing is based on **byte count** of the generated audio\n- Voice cloning requires a clear reference audio sample\n- Text-to-speech supports multiple languages automatically\n- Use the `/fish/voices` endpoint to register a reference audio and receive a `voice_id` for TTS","tags":["fish","audio","skills","acedatacloud","acedata-cloud","agent-skills","agentskills","ai-image","ai-music","ai-tools","ai-video","claude-code"],"capabilities":["skill","source-acedatacloud","skill-fish-audio","topic-acedata-cloud","topic-agent-skills","topic-agentskills","topic-ai-image","topic-ai-music","topic-ai-tools","topic-ai-video","topic-claude-code","topic-cursor","topic-gemini-cli","topic-github-copilot","topic-mcp"],"categories":["Skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/AceDataCloud/Skills/fish-audio","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add AceDataCloud/Skills","source_repo":"https://github.com/AceDataCloud/Skills","install_from":"skills.sh"}},"qualityScore":"0.453","qualityRationale":"deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 7 github stars · SKILL.md body (2,534 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:14:01.809Z","embedding":null,"createdAt":"2026-05-18T13:21:32.068Z","updatedAt":"2026-05-18T19:14:01.809Z","lastSeenAt":"2026-05-18T19:14:01.809Z","tsv":"'-1.5':236,238 '/_shared/async-tasks.md':97 '/_shared/authentication.md':57 '/avatar.jpg':188 '/fish/audios':69,109,135,199,208 '/fish/tasks':101,123 '/fish/voices':117,166,254,327 '/reference-voice.mp3':171 '1':128 '2':149 '3':189 'acedatacloud':14,49,73 'action':240 'ai':5,43,89 'api':15,53,74 'api.acedata.cloud':68 'api.acedata.cloud/fish/audios':67 'application/json':80 'async':92,94,252,296 'audio':3,6,12,23,28,38,40,44,52,111,158,262,308,315,333 'authent':56 'author':71 'automat':324 'base':301 'bash':63 'bearer':72 'brown':139 'byte':303 'callback':246,290 'capabl':36 'clear':177,313 'clone':121,151,195,225,266,273,310 'cloneabl':162 'content':29,78 'content-typ':77 'count':304 'cover':284 'creat':18,160 'curl':64 'custom':174 'd':81 'default':148 'deliveri':253,297 'demonstr':87 'descript':176,211,257,275,277 'display':269 'dog':145 'e.g':234,244 'endpoint':105,106,328 'example.com':170,187 'example.com/avatar.jpg':186 'example.com/reference-voice.mp3':169 'fish':2,11,37,51 'fish-audio':1 'fox':140 'generat':4,27,42,110,245,307 'gotcha':298 'h':70,76 'hd':239 'hello':83 'id':104,147,206,220,227,338 'imag':184,281,285 'json':133,164,197 'jump':141 'languag':323 'lazi':144 'model':33,223,230,233 'multipl':31,322 'neutral':179 'neutral-ton':178 'oper':242 'paramet':115,207,209,255 'platform':204 'poll':96,98,124 'post':66,100,108,116,122,134,165,198 'price':299 'prompt':82,136,200,212 'purpos':107 'quick':61,138 'receiv':335 'refer':157,261,314,332 'regist':152,330 'requir':311 'sampl':316 'see':55,93 'setup':54,60 'skill' 'skill-fish-audio' 'source-acedatacloud' 'speaker':181 'speech':22,132,193,218,235,237,320 'start':62 'status':126 'string':213,221,231,241,248,260,268,276,283,292 'support':30,321 'synthes':8,24,46,216 'synthesi':41,91,119 'task':95,103,125 'text':20,113,130,191,214,318 'text-to-speech':19,129,190,317 'titl':172,267,270 'token':59,75 'tone':180 'topic-acedata-cloud' 'topic-agent-skills' 'topic-agentskills' 'topic-ai-image' 'topic-ai-music' 'topic-ai-tools' 'topic-ai-video' 'topic-claude-code' 'topic-cursor' 'topic-gemini-cli' 'topic-github-copilot' 'topic-mcp' 'tts':35,183,232,340 'type':79,210,243,256 'upload':155 'url':168,185,247,250,259,263,282,286,291,294 'use':16,229,325 'via':13,99 'voic':9,25,32,39,47,90,118,146,150,154,163,167,175,196,205,219,222,226,258,265,274,280,289,309,337 'webhook':249,293 'welcom':201 'workflow':127 'x':65","prices":[{"id":"a50b7a7e-6680-4eb2-a320-4d894e2a152e","listingId":"c57ce3ec-975c-4e46-a3c8-bc372ddbea95","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"AceDataCloud","category":"Skills","install_from":"skills.sh"},"createdAt":"2026-05-18T13:21:32.068Z"}],"sources":[{"listingId":"c57ce3ec-975c-4e46-a3c8-bc372ddbea95","source":"github","sourceId":"AceDataCloud/Skills/fish-audio","sourceUrl":"https://github.com/AceDataCloud/Skills/tree/main/skills/fish-audio","isPrimary":false,"firstSeenAt":"2026-05-18T13:21:32.068Z","lastSeenAt":"2026-05-18T19:14:01.809Z"}],"details":{"listingId":"c57ce3ec-975c-4e46-a3c8-bc372ddbea95","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"AceDataCloud","slug":"fish-audio","github":{"repo":"AceDataCloud/Skills","stars":7,"topics":["acedata-cloud","agent-skills","agentskills","ai-image","ai-music","ai-tools","ai-video","claude-code","cursor","gemini-cli","github-copilot","mcp","npm","openai-codex","roo-code"],"license":"other","html_url":"https://github.com/AceDataCloud/Skills","pushed_at":"2026-05-18T07:35:03Z","description":"Agent Skills for AceDataCloud AI services — music, image, video generation, web search, and more. Compatible with Claude Code, GitHub Copilot, Gemini CLI, and all agentskills.io-compatible agents.","skill_md_sha":"ef22d1eaeec319548fd331f87a3071efa7415127","skill_md_path":"skills/fish-audio/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/AceDataCloud/Skills/tree/main/skills/fish-audio"},"layout":"multi","source":"github","category":"Skills","frontmatter":{"name":"fish-audio","license":"Apache-2.0","description":"Generate AI audio and synthesize voices with Fish Audio via AceDataCloud API. Use when creating text-to-speech audio, synthesizing voices, or generating audio content. Supports multiple voice models and TTS capabilities.","compatibility":"Requires ACEDATACLOUD_API_TOKEN in .env file (see _shared/authentication.md)."},"skills_sh_url":"https://skills.sh/AceDataCloud/Skills/fish-audio"},"updatedAt":"2026-05-18T19:14:01.809Z"}}