{"id":"5607bf0b-86fd-4ff3-89e9-e84596cf07c5","shortId":"HmCfFG","kind":"skill","title":"qdrant-deployment-options","tagline":"Guides Qdrant deployment selection. Use when someone asks 'how to deploy Qdrant', 'Docker vs Cloud', 'local mode', 'embedded Qdrant', 'Qdrant EDGE', 'which deployment option', 'self-hosted vs cloud', or 'need lowest latency deployment'. Also use when choosing between deployment t","description":"# Which Qdrant Deployment Do I Need?\n\nStart with what you need: managed ops or full control? Network latency acceptable or not? Production or prototyping? The answer narrows to one of four options.\n\n\n## Getting Started or Prototyping\n\nUse when: building a prototype, running tests, CI/CD pipelines, or learning Qdrant.\n\n- Use local mode (Python only): zero-dependency, in-memory or disk-persisted, no server needed [Local mode](https://search.qdrant.tech/md/documentation/quickstart/)\n- Local mode data format is NOT compatible with server. Do not use for production or benchmarking.\n- For a real server locally, use Docker [Quick start](https://search.qdrant.tech/md/documentation/quickstart/?s=download-and-run)\n\n\n## Going to Production (Self-Hosted)\n\nUse when: you need full control over infrastructure, data residency, or custom configuration.\n\n- Docker is the default deployment. Full Qdrant Open Source feature set, minimal setup. [Quick start](https://search.qdrant.tech/md/documentation/quickstart/?s=download-and-run)\n- You own operations: upgrades, backups, scaling, monitoring\n- Must set up distributed mode manually for multi-node clusters [Distributed deployment](https://search.qdrant.tech/md/documentation/distributed_deployment/)\n- Consider Hybrid Cloud if you want Qdrant Cloud management on your infrastructure [Hybrid Cloud](https://search.qdrant.tech/md/documentation/hybrid-cloud/)\n\n\n## Going to Production (Zero-Ops)\n\nUse when: you want managed infrastructure with zero-downtime updates, automatic backups, and resharding without operating clusters yourself.\n\n- Qdrant Cloud handles upgrades, scaling, backups, and monitoring [Qdrant Cloud](https://search.qdrant.tech/md/documentation/cloud-quickstart/)\n- Supports multi-version upgrades automatically\n- Provides features not available in self-hosted: `/sys_metrics`, managed resharding, pre-configured alerts\n\n\n## Need Lowest Possible Latency\n\nUse when: network round-trip to a server is unacceptable. Edge devices, in-process search, or latency-critical applications.\n\n- Qdrant EDGE: in-process bindings to Qdrant shard-level functions, no network overhead [Qdrant EDGE](https://search.qdrant.tech/md/documentation/edge/edge-quickstart/)\n- Same data format as server. Can sync with server via shard snapshots.\n- Single-node feature set only. No distributed mode.\n\n\n## What NOT to Do\n\n- Use local mode for production or benchmarking (not optimized, incompatible data format)\n- Self-host without monitoring and backup strategy (you will lose data or miss outages)\n- Choose EDGE when you need distributed search (single-node only)\n- Pick Hybrid Cloud unless you have data residency requirements (unnecessary Kubernetes complexity when Qdrant Cloud works)","tags":["qdrant","deployment","options","skills","agent-skills","ai-agents","claude-code","codex","cursor","embeddings","hybrid-search","monitoring"],"capabilities":["skill","source-qdrant","skill-qdrant-deployment-options","topic-agent-skills","topic-ai-agents","topic-claude-code","topic-codex","topic-cursor","topic-embeddings","topic-hybrid-search","topic-monitoring","topic-multitenancy","topic-performance","topic-qdrant","topic-quantization"],"categories":["skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/qdrant/skills/qdrant-deployment-options","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add qdrant/skills","source_repo":"https://github.com/qdrant/skills","install_from":"skills.sh"}},"qualityScore":"0.489","qualityRationale":"deterministic score 0.49 from registry signals: · indexed on github topic:agent-skills · 79 github stars · SKILL.md body (2,725 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-04-22T06:55:38.629Z","embedding":null,"createdAt":"2026-04-18T22:12:58.699Z","updatedAt":"2026-04-22T06:55:38.629Z","lastSeenAt":"2026-04-22T06:55:38.629Z","tsv":"'/md/documentation/cloud-quickstart/)':259 '/md/documentation/distributed_deployment/)':204 '/md/documentation/edge/edge-quickstart/)':326 '/md/documentation/hybrid-cloud/)':221 '/md/documentation/quickstart/)':116 '/md/documentation/quickstart/?s=download-and-run)':144,181 '/sys_metrics':274 'accept':64 'alert':280 'also':39 'answer':71 'applic':306 'ask':12 'automat':239,265 'avail':269 'backup':186,240,252,370 'benchmark':132,358 'bind':312 'build':84 'choos':42,379 'ci/cd':89 'cloud':19,33,207,212,218,248,256,392,404 'cluster':199,245 'compat':123 'complex':401 'configur':163,279 'consid':205 'control':61,156 'critic':305 'custom':162 'data':119,159,328,362,375,396 'default':167 'depend':101 'deploy':3,7,15,27,38,44,48,168,201 'devic':297 'disk':107 'disk-persist':106 'distribut':192,200,346,384 'docker':17,139,164 'downtim':237 'edg':25,296,308,323,380 'embed':22 'featur':173,267,342 'format':120,329,363 'four':76 'full':60,155,169 'function':318 'get':78 'go':145,222 'guid':5 'handl':249 'host':31,150,273,366 'hybrid':206,217,391 'in-memori':102 'in-process':298,309 'incompat':361 'infrastructur':158,216,233 'kubernet':400 'latenc':37,63,284,304 'latency-crit':303 'learn':92 'level':317 'local':20,95,112,117,137,353 'lose':374 'lowest':36,282 'manag':57,213,232,275 'manual':194 'memori':104 'minim':175 'miss':377 'mode':21,96,113,118,193,347,354 'monitor':188,254,368 'multi':197,262 'multi-nod':196 'multi-vers':261 'must':189 'narrow':72 'need':35,51,56,111,154,281,383 'network':62,287,320 'node':198,341,388 'one':74 'op':58,227 'open':171 'oper':184,244 'optim':360 'option':4,28,77 'outag':378 'overhead':321 'persist':108 'pick':390 'pipelin':90 'possibl':283 'pre':278 'pre-configur':277 'process':300,311 'product':67,130,147,224,356 'prototyp':69,81,86 'provid':266 'python':97 'qdrant':2,6,16,23,24,47,93,170,211,247,255,307,314,322,403 'qdrant-deployment-opt':1 'quick':140,177 'real':135 'requir':398 'reshard':242,276 'resid':160,397 'round':289 'round-trip':288 'run':87 'scale':187,251 'search':301,385 'search.qdrant.tech':115,143,180,203,220,258,325 'search.qdrant.tech/md/documentation/cloud-quickstart/)':257 'search.qdrant.tech/md/documentation/distributed_deployment/)':202 'search.qdrant.tech/md/documentation/edge/edge-quickstart/)':324 'search.qdrant.tech/md/documentation/hybrid-cloud/)':219 'search.qdrant.tech/md/documentation/quickstart/)':114 'search.qdrant.tech/md/documentation/quickstart/?s=download-and-run)':142,179 'select':8 'self':30,149,272,365 'self-host':29,148,271,364 'server':110,125,136,293,331,335 'set':174,190,343 'setup':176 'shard':316,337 'shard-level':315 'singl':340,387 'single-nod':339,386 'skill' 'skill-qdrant-deployment-options' 'snapshot':338 'someon':11 'sourc':172 'source-qdrant' 'start':52,79,141,178 'strategi':371 'support':260 'sync':333 'test':88 'topic-agent-skills' 'topic-ai-agents' 'topic-claude-code' 'topic-codex' 'topic-cursor' 'topic-embeddings' 'topic-hybrid-search' 'topic-monitoring' 'topic-multitenancy' 'topic-performance' 'topic-qdrant' 'topic-quantization' 'trip':290 'unaccept':295 'unless':393 'unnecessari':399 'updat':238 'upgrad':185,250,264 'use':9,40,82,94,128,138,151,228,285,352 'version':263 'via':336 'vs':18,32 'want':210,231 'without':243,367 'work':405 'zero':100,226,236 'zero-depend':99 'zero-downtim':235 'zero-op':225","prices":[{"id":"e8fc4b44-5ad4-4d5b-989f-0abccac0a3e3","listingId":"5607bf0b-86fd-4ff3-89e9-e84596cf07c5","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"qdrant","category":"skills","install_from":"skills.sh"},"createdAt":"2026-04-18T22:12:58.699Z"}],"sources":[{"listingId":"5607bf0b-86fd-4ff3-89e9-e84596cf07c5","source":"github","sourceId":"qdrant/skills/qdrant-deployment-options","sourceUrl":"https://github.com/qdrant/skills/tree/main/skills/qdrant-deployment-options","isPrimary":false,"firstSeenAt":"2026-04-18T22:12:58.699Z","lastSeenAt":"2026-04-22T06:55:38.629Z"}],"details":{"listingId":"5607bf0b-86fd-4ff3-89e9-e84596cf07c5","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"qdrant","slug":"qdrant-deployment-options","github":{"repo":"qdrant/skills","stars":79,"topics":["agent-skills","ai-agents","claude-code","codex","cursor","embeddings","hybrid-search","monitoring","multitenancy","performance","qdrant","quantization","scaling","search-quality","vector-database","vector-search"],"license":"apache-2.0","html_url":"https://github.com/qdrant/skills","pushed_at":"2026-04-20T17:42:39Z","description":"Agent skills for Qdrant vector search: scaling, performance optimization, search quality, monitoring, deployment, model migration, version upgrades, and SDK usage across Python, TypeScript, Rust, Go, .NET, Java","skill_md_sha":"c894f8df6658ca04559b066a26d6fd368b920d8d","skill_md_path":"skills/qdrant-deployment-options/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/qdrant/skills/tree/main/skills/qdrant-deployment-options"},"layout":"multi","source":"github","category":"skills","frontmatter":{"name":"qdrant-deployment-options","description":"Guides Qdrant deployment selection. Use when someone asks 'how to deploy Qdrant', 'Docker vs Cloud', 'local mode', 'embedded Qdrant', 'Qdrant EDGE', 'which deployment option', 'self-hosted vs cloud', or 'need lowest latency deployment'. Also use when choosing between deployment types for a new project."},"skills_sh_url":"https://skills.sh/qdrant/skills/qdrant-deployment-options"},"updatedAt":"2026-04-22T06:55:38.629Z"}}