{"id":"6f167702-b3d3-4876-acba-8a92d7990337","shortId":"dEK6Xy","kind":"skill","title":"Stable Diffusion XL Pipeline","tagline":"Orchestrates SDXL image generation via the Stability AI REST API with ControlNet conditioning, IP-Adapter style transfer, and automatic prompt enhancement using CLIP interrogation.","description":"# Stable Diffusion XL Pipeline\n\nOrchestrates SDXL image generation via the Stability AI REST API with ControlNet conditioning, IP-Adapter style transfer, and automatic prompt enhancement using CLIP interrogation.\n\n## Installation\n\nUse the upstream install or setup path that matches your environment:\n- git clone https://github.com/Stability-AI/generative-models.git\n- pip install hatch\n- pip install \"numpy>=1.17\" \"PyWavelets>=1.1.1\" \"opencv-python>=4.1.0.25\"\n- pip install --no-deps invisible-watermark\n\nRequirements and caveats from upstream:\n- python scripts/sampling/simple_video_sample_4d2.py --input_path assets/sv4d_videos/camel.gif --output_folder outputs (after downloading [sv4d2.safetensors](https://huggingface.co/stabilityai/sv4d2.0) from HuggingFace...\n- Run inference: python scripts/sampling/simple_video_sample_4d2.py --input_path <path/to/video>\n- Run inference: python scripts/sampling/simple_video_sample_4d2.py --model_path checkpoints/sv4d2_8views.safetensors --input_path assets/sv4d_videos/chest.gif --output_folder outputs\n\nBasic usage or getting-started notes:\n- To run **SV4D 2.0** on a single input video of 21 frames:\n- **Low VRAM environment** : To run on GPUs with low VRAM, try setting --encoding_t=1 (of frames encoded at a time) and --decoding_t=1 (of frames decoded at a time) or lower video resolution like --img_size=512.\n- The 5x8 model takes 5 frames of input at a time. But the inference scripts for both model take 21-frame video as input by default (same as SV3D and SV4D), we run the model autoregressively until we generate 21 frames.\n\n- Source: https://github.com/Stability-AI/generative-models\n- Extracted from upstream docs: https://raw.githubusercontent.com/Stability-AI/generative-models/HEAD/README.md\n\n## Source\n\n- [Agent Skill Exchange](https://agentskillexchange.com/skills/stable-diffusion-xl-pipeline/)","tags":["stable","diffusion","pipeline","skills","agentskillexchange","agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex","cursor"],"capabilities":["skill","source-agentskillexchange","skill-stable-diffusion-xl-pipeline","topic-agent-skills","topic-ai-agents","topic-ai-tools","topic-awesome-list","topic-claude-code","topic-codex","topic-cursor","topic-llm","topic-mcp","topic-npx-skills","topic-openclaw","topic-skills-catalog"],"categories":["skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/agentskillexchange/skills/stable-diffusion-xl-pipeline","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add agentskillexchange/skills","source_repo":"https://github.com/agentskillexchange/skills","install_from":"skills.sh"}},"qualityScore":"0.454","qualityRationale":"deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (1,858 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:12:37.386Z","embedding":null,"createdAt":"2026-05-18T13:19:37.711Z","updatedAt":"2026-05-18T19:12:37.386Z","lastSeenAt":"2026-05-18T19:12:37.386Z","tsv":"'/skills/stable-diffusion-xl-pipeline/)':254 '/stability-ai/generative-models':240 '/stability-ai/generative-models.git':75 '/stability-ai/generative-models/head/readme.md':247 '/stabilityai/sv4d2.0)':115 '1':171,181 '1.1.1':84 '1.17':82 '2.0':148 '21':155,215,235 '4.1.0.25':88 '5':200 '512':195 '5x8':197 'adapt':20,49 'agent':249 'agentskillexchange.com':253 'agentskillexchange.com/skills/stable-diffusion-xl-pipeline/)':252 'ai':12,41 'api':14,43 'assets/sv4d_videos/camel.gif':106 'assets/sv4d_videos/chest.gif':134 'automat':24,53 'autoregress':231 'basic':138 'caveat':99 'checkpoints/sv4d2_8views.safetensors':131 'clip':28,57 'clone':72 'condit':17,46 'controlnet':16,45 'decod':179,184 'default':221 'dep':93 'diffus':2,31 'doc':244 'download':111 'encod':169,174 'enhanc':26,55 'environ':70,159 'exchang':251 'extract':241 'folder':108,136 'frame':156,173,183,201,216,236 'generat':8,37,234 'get':142 'getting-start':141 'git':71 'github.com':74,239 'github.com/stability-ai/generative-models':238 'github.com/stability-ai/generative-models.git':73 'gpus':163 'hatch':78 'huggingfac':117 'huggingface.co':114 'huggingface.co/stabilityai/sv4d2.0)':113 'imag':7,36 'img':193 'infer':119,126,209 'input':104,122,132,152,203,219 'instal':59,63,77,80,90 'interrog':29,58 'invis':95 'invisible-watermark':94 'ip':19,48 'ip-adapt':18,47 'like':192 'low':157,165 'lower':189 'match':68 'model':129,198,213,230 'no-dep':91 'note':144 'numpi':81 'opencv':86 'opencv-python':85 'orchestr':5,34 'output':107,109,135,137 'path':66,105,123,130,133 'path/to/video':124 'pip':76,79,89 'pipelin':4,33 'prompt':25,54 'python':87,102,120,127 'pywavelet':83 'raw.githubusercontent.com':246 'raw.githubusercontent.com/stability-ai/generative-models/head/readme.md':245 'requir':97 'resolut':191 'rest':13,42 'run':118,125,146,161,228 'script':210 'scripts/sampling/simple_video_sample_4d2.py':103,121,128 'sdxl':6,35 'set':168 'setup':65 'singl':151 'size':194 'skill':250 'skill-stable-diffusion-xl-pipeline' 'sourc':237,248 'source-agentskillexchange' 'stabil':11,40 'stabl':1,30 'start':143 'style':21,50 'sv3d':224 'sv4d':147,226 'sv4d2.safetensors':112 'take':199,214 'time':177,187,206 'topic-agent-skills' 'topic-ai-agents' 'topic-ai-tools' 'topic-awesome-list' 'topic-claude-code' 'topic-codex' 'topic-cursor' 'topic-llm' 'topic-mcp' 'topic-npx-skills' 'topic-openclaw' 'topic-skills-catalog' 'transfer':22,51 'tri':167 'upstream':62,101,243 'usag':139 'use':27,56,60 'via':9,38 'video':153,190,217 'vram':158,166 'watermark':96 'xl':3,32","prices":[{"id":"62ede67e-d4d4-467f-8f01-db5945681ec8","listingId":"6f167702-b3d3-4876-acba-8a92d7990337","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"agentskillexchange","category":"skills","install_from":"skills.sh"},"createdAt":"2026-05-18T13:19:37.711Z"}],"sources":[{"listingId":"6f167702-b3d3-4876-acba-8a92d7990337","source":"github","sourceId":"agentskillexchange/skills/stable-diffusion-xl-pipeline","sourceUrl":"https://github.com/agentskillexchange/skills/tree/main/skills/stable-diffusion-xl-pipeline","isPrimary":false,"firstSeenAt":"2026-05-18T13:19:37.711Z","lastSeenAt":"2026-05-18T19:12:37.386Z"}],"details":{"listingId":"6f167702-b3d3-4876-acba-8a92d7990337","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"agentskillexchange","slug":"stable-diffusion-xl-pipeline","github":{"repo":"agentskillexchange/skills","stars":8,"topics":["agent-skills","ai-agents","ai-tools","awesome-list","claude-code","codex","cursor","llm","mcp","npx-skills","openclaw","skills-catalog"],"license":"mit","html_url":"https://github.com/agentskillexchange/skills","pushed_at":"2026-05-18T19:02:17Z","description":"The open catalog of AI agent skills — 2,000+ security-scanned skills for Claude Code, Cursor, Codex, and more.","skill_md_sha":"85646cd4a8596d77667d79fd1622601bfa471d2b","skill_md_path":"skills/stable-diffusion-xl-pipeline/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/agentskillexchange/skills/tree/main/skills/stable-diffusion-xl-pipeline"},"layout":"multi","source":"github","category":"skills","frontmatter":{"name":"Stable Diffusion XL Pipeline","description":"Orchestrates SDXL image generation via the Stability AI REST API with ControlNet conditioning, IP-Adapter style transfer, and automatic prompt enhancement using CLIP interrogation."},"skills_sh_url":"https://skills.sh/agentskillexchange/skills/stable-diffusion-xl-pipeline"},"updatedAt":"2026-05-18T19:12:37.386Z"}}