{"id":"bc688a07-d105-446b-ad0b-9ecf2dfbf1b0","shortId":"6fLwah","kind":"skill","title":"azure-reliability","tagline":"Expert knowledge for Azure Reliability development including best practices, decision making, architecture & design patterns, limits & quotas, and deployment. Use when designing zone/multi-region apps, AZ-enabled MySQL, resilient Functions, AKS/DB HA, or Queue size limits, and ot","description":"# Azure Reliability Skill\n\nThis skill provides expert guidance for Azure Reliability. Covers best practices, decision making, architecture & design patterns, limits & quotas, and deployment. It combines local quick-reference content with remote documentation fetching capabilities.\n\n## How to Use This Skill\n\n> **IMPORTANT for Agent**: Use the **Category Index** below to locate relevant sections. For categories with line ranges (e.g., `L35-L120`), use `read_file` with the specified lines. For categories with file links (e.g., `[security.md](security.md)`), use `read_file` on the linked reference file\n\n> **IMPORTANT for Agent**: If `metadata.generated_at` is more than 3 months old, suggest the user pull the latest version from the repository. If `mcp_microsoftdocs` tools are not available, suggest the user install it: [Installation Guide](https://github.com/MicrosoftDocs/mcp/blob/main/README.md)\n\nThis skill requires **network access** to fetch documentation content:\n- **Preferred**: Use `mcp_microsoftdocs:microsoft_docs_fetch` with query string `from=learn-agent-skill`. Returns Markdown.\n- **Fallback**: Use `fetch_webpage` with query string `from=learn-agent-skill&accept=text/markdown`. Returns Markdown.\n\n## Category Index\n\n| Category | Lines | Description |\n|----------|-------|-------------|\n| Best Practices | L33-L68 | Patterns and checklists for designing, configuring, and hardening high‑availability, resilient architectures for specific Azure services (AKS, DBs, messaging, networking, monitoring, DR). |\n| Decision Making | L69-L73 | Guidance on using availability zones, nonregional services, and resilient Azure Functions architectures to design highly available, fault-tolerant Azure solutions. |\n| Architecture & Design Patterns | L74-L80 | Designing Azure apps for high availability using zones and multi-region patterns, including planning zone-resilient workloads, hardening zonal deployments, and building in nonpaired regions. |\n| Limits & Quotas | L81-L85 | Details on Azure Queue Storage message size limits, including max message size, behavior when limits are exceeded, and best practices for handling large payloads. |\n| Deployment | L86-L89 | Guidance on deploying Azure services and MySQL Flexible Server with availability zones, including configuring zone-redundant high availability and migration to zone-resilient setups. |\n\n### Best Practices\n| Topic | URL |\n|-------|-----|\n| Design resilient clusters in Azure Kubernetes Service | https://learn.microsoft.com/en-us/azure/reliability/reliability-aks |\n| Configure reliability for Azure API Center | https://learn.microsoft.com/en-us/azure/reliability/reliability-api-center |\n| Harden Azure App Service Environment reliability | https://learn.microsoft.com/en-us/azure/reliability/reliability-app-service-environment |\n| Architect highly available Azure Application Gateway v2 | https://learn.microsoft.com/en-us/azure/reliability/reliability-application-gateway-v2 |\n| Plan reliability for Azure Bot Service | https://learn.microsoft.com/en-us/azure/reliability/reliability-bot |\n| Configure reliability for Azure Chaos Studio | https://learn.microsoft.com/en-us/azure/reliability/reliability-chaos-studio |\n| Achieve high availability in Azure Cosmos DB NoSQL | https://learn.microsoft.com/en-us/azure/reliability/reliability-cosmos-db-nosql |\n| Design resilient Azure Data Explorer deployments | https://learn.microsoft.com/en-us/azure/reliability/reliability-data-explorer |\n| Harden Azure Data Factory for outages | https://learn.microsoft.com/en-us/azure/reliability/reliability-data-factory |\n| Harden Azure Database for MySQL for high availability | https://learn.microsoft.com/en-us/azure/reliability/reliability-database-mysql |\n| Design resilient Azure Database for MySQL deployments | https://learn.microsoft.com/en-us/azure/reliability/reliability-database-mysql |\n| Implement high availability for Azure Database for PostgreSQL | https://learn.microsoft.com/en-us/azure/reliability/reliability-database-postgresql |\n| Implement resilient architectures in Azure Databricks | https://learn.microsoft.com/en-us/azure/reliability/reliability-databricks |\n| Ensure reliability for Azure Device Registry metadata | https://learn.microsoft.com/en-us/azure/reliability/reliability-device-registry |\n| Design high availability for Azure DocumentDB | https://learn.microsoft.com/en-us/azure/reliability/reliability-documentdb |\n| Build resilient architectures with Azure Event Grid | https://learn.microsoft.com/en-us/azure/reliability/reliability-event-grid |\n| Increase reliability of Azure Event Hubs streaming | https://learn.microsoft.com/en-us/azure/reliability/reliability-event-hubs |\n| Design reliable analytics with Microsoft Fabric | https://learn.microsoft.com/en-us/azure/reliability/reliability-fabric |\n| Implement resilient architectures with Azure Functions | https://learn.microsoft.com/en-us/azure/reliability/reliability-functions |\n| Implement resilient architectures with Azure Functions | https://learn.microsoft.com/en-us/azure/reliability/reliability-functions |\n| Implement disaster recovery for Azure Image Builder | https://learn.microsoft.com/en-us/azure/reliability/reliability-image-builder |\n| Design resilient device connectivity with Azure IoT Hub | https://learn.microsoft.com/en-us/azure/reliability/reliability-iot-hub |\n| Design resilient architectures with Azure Load Balancer | https://learn.microsoft.com/en-us/azure/reliability/reliability-load-balancer |\n| Design resilient architectures with Azure Load Balancer | https://learn.microsoft.com/en-us/azure/reliability/reliability-load-balancer |\n| Design resilient workflows with Azure Logic Apps | https://learn.microsoft.com/en-us/azure/reliability/reliability-logic-apps |\n| Increase reliability of Azure Managed Redis caches | https://learn.microsoft.com/en-us/azure/reliability/reliability-managed-redis |\n| Implement resilient logging with Azure Monitor Logs | https://learn.microsoft.com/en-us/azure/reliability/reliability-monitor-logs |\n| Improve reliability of Azure Notification Hubs | https://learn.microsoft.com/en-us/azure/reliability/reliability-notification-hubs |\n| Design resilient disaster recovery with Azure Site Recovery | https://learn.microsoft.com/en-us/azure/reliability/reliability-site-recovery |\n| Implement resilient architectures in Azure SQL Database | https://learn.microsoft.com/en-us/azure/reliability/reliability-sql-database |\n| Increase reliability of Azure Stream Analytics jobs | https://learn.microsoft.com/en-us/azure/reliability/reliability-stream-analytics |\n| Design resilient workloads on Azure VMware Solution | https://learn.microsoft.com/en-us/azure/reliability/reliability-vmware-solution |\n\n### Decision Making\n| Topic | URL |\n|-------|-----|\n| Select and understand Azure nonregional services | https://learn.microsoft.com/en-us/azure/reliability/regions-nonregional-services |\n\n### Architecture & Design Patterns\n| Topic | URL |\n|-------|-----|\n| Enable and plan zone-resilient Azure workloads | https://learn.microsoft.com/en-us/azure/reliability/availability-zones-enable-zone-resiliency |\n| Design and harden zonal Azure resource deployments | https://learn.microsoft.com/en-us/azure/reliability/availability-zones-zonal-resource-resiliency |\n| Design multi-region solutions in nonpaired Azure regions | https://learn.microsoft.com/en-us/azure/reliability/regions-multi-region-nonpaired |\n\n### Limits & Quotas\n| Topic | URL |\n|-------|-----|\n| Understand Azure Queue Storage message size limits | https://learn.microsoft.com/en-us/azure/reliability/reliability-storage-queue |\n\n### Deployment\n| Topic | URL |\n|-------|-----|\n| Use Azure services with availability zone support | https://learn.microsoft.com/en-us/azure/reliability/availability-zones-service-support |","tags":["azure","reliability","agent","skills","microsoftdocs","agent-skills","agentic-skills","agentskill","ai-agents","ai-coding","azure-functions","azure-kubernetes-service"],"capabilities":["skill","source-microsoftdocs","skill-azure-reliability","topic-agent","topic-agent-skills","topic-agentic-skills","topic-agentskill","topic-ai-agents","topic-ai-coding","topic-azure","topic-azure-functions","topic-azure-kubernetes-service","topic-azure-openai","topic-azure-sql-database","topic-azure-storage"],"categories":["Agent-Skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/MicrosoftDocs/Agent-Skills/azure-reliability","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add MicrosoftDocs/Agent-Skills","source_repo":"https://github.com/MicrosoftDocs/Agent-Skills","install_from":"skills.sh"}},"qualityScore":"0.698","qualityRationale":"deterministic score 0.70 from registry signals: · indexed on github topic:agent-skills · 497 github stars · SKILL.md body (7,816 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-04-22T00:53:36.439Z","embedding":null,"createdAt":"2026-04-18T21:59:56.473Z","updatedAt":"2026-04-22T00:53:36.439Z","lastSeenAt":"2026-04-22T00:53:36.439Z","tsv":"'/en-us/azure/reliability/availability-zones-enable-zone-resiliency':701 '/en-us/azure/reliability/availability-zones-service-support':750 '/en-us/azure/reliability/availability-zones-zonal-resource-resiliency':711 '/en-us/azure/reliability/regions-multi-region-nonpaired':723 '/en-us/azure/reliability/regions-nonregional-services':685 '/en-us/azure/reliability/reliability-aks':369 '/en-us/azure/reliability/reliability-api-center':378 '/en-us/azure/reliability/reliability-app-service-environment':387 '/en-us/azure/reliability/reliability-application-gateway-v2':397 '/en-us/azure/reliability/reliability-bot':406 '/en-us/azure/reliability/reliability-chaos-studio':415 '/en-us/azure/reliability/reliability-cosmos-db-nosql':426 '/en-us/azure/reliability/reliability-data-explorer':435 '/en-us/azure/reliability/reliability-data-factory':444 '/en-us/azure/reliability/reliability-database-mysql':455,465 '/en-us/azure/reliability/reliability-database-postgresql':476 '/en-us/azure/reliability/reliability-databricks':485 '/en-us/azure/reliability/reliability-device-registry':495 '/en-us/azure/reliability/reliability-documentdb':504 '/en-us/azure/reliability/reliability-event-grid':514 '/en-us/azure/reliability/reliability-event-hubs':524 '/en-us/azure/reliability/reliability-fabric':533 '/en-us/azure/reliability/reliability-functions':542,551 '/en-us/azure/reliability/reliability-image-builder':561 '/en-us/azure/reliability/reliability-iot-hub':572 '/en-us/azure/reliability/reliability-load-balancer':582,592 '/en-us/azure/reliability/reliability-logic-apps':602 '/en-us/azure/reliability/reliability-managed-redis':612 '/en-us/azure/reliability/reliability-monitor-logs':622 '/en-us/azure/reliability/reliability-notification-hubs':631 '/en-us/azure/reliability/reliability-site-recovery':642 '/en-us/azure/reliability/reliability-sql-database':652 '/en-us/azure/reliability/reliability-storage-queue':737 '/en-us/azure/reliability/reliability-stream-analytics':662 '/en-us/azure/reliability/reliability-vmware-solution':672 '/microsoftdocs/mcp/blob/main/readme.md)':163 '3':134 'accept':202 'access':168 'achiev':416 'agent':83,127,186,200 'ak':232 'aks/db':33 'analyt':527,658 'api':374 'app':26,272,381,599 'applic':392 'architect':388 'architectur':15,57,227,254,264,479,507,536,545,575,585,645,686 'avail':153,225,246,258,275,340,348,390,418,452,468,498,745 'az':28 'az-en':27 'azur':2,7,41,50,230,252,262,271,304,333,364,373,380,391,401,410,420,429,437,446,458,470,481,489,500,509,518,538,547,556,567,577,587,597,606,617,626,637,647,656,667,680,697,706,719,729,742 'azure-reli':1 'balanc':579,589 'behavior':314 'best':11,53,211,320,356 'bot':402 'build':293,505 'builder':558 'cach':609 'capabl':75 'categori':86,94,110,206,208 'center':375 'chao':411 'checklist':218 'cluster':362 'combin':65 'configur':221,343,370,407 'connect':565 'content':70,172 'cosmos':421 'cover':52 'data':430,438 'databas':447,459,471,649 'databrick':482 'db':422 'dbs':233 'decis':13,55,238,673 'deploy':21,63,291,326,332,432,462,708,738 'descript':210 'design':16,24,58,220,256,265,270,360,427,456,496,525,562,573,583,593,632,663,687,702,712 'detail':302 'develop':9 'devic':490,564 'disast':553,634 'doc':178 'document':73,171 'documentdb':501 'dr':237 'e.g':98,114 'enabl':29,691 'ensur':486 'environ':383 'event':510,519 'exceed':318 'expert':4,47 'explor':431 'fabric':530 'factori':439 'fallback':190 'fault':260 'fault-toler':259 'fetch':74,170,179,192 'file':104,112,119,124 'flexibl':337 'function':32,253,539,548 'gateway':393 'github.com':162 'github.com/microsoftdocs/mcp/blob/main/readme.md)':161 'grid':511 'guid':160 'guidanc':48,243,330 'ha':34 'handl':323 'harden':223,289,379,436,445,704 'high':224,257,274,347,389,417,451,467,497 'hub':520,569,628 'imag':557 'implement':466,477,534,543,552,613,643 'import':81,125 'improv':623 'includ':10,283,310,342 'increas':515,603,653 'index':87,207 'instal':157,159 'iot':568 'job':659 'knowledg':5 'kubernet':365 'l120':101 'l33':214 'l33-l68':213 'l35':100 'l35-l120':99 'l68':215 'l69':241 'l69-l73':240 'l73':242 'l74':268 'l74-l80':267 'l80':269 'l81':300 'l81-l85':299 'l85':301 'l86':328 'l86-l89':327 'l89':329 'larg':324 'latest':142 'learn':185,199 'learn-agent-skil':184,198 'learn.microsoft.com':368,377,386,396,405,414,425,434,443,454,464,475,484,494,503,513,523,532,541,550,560,571,581,591,601,611,621,630,641,651,661,671,684,700,710,722,736,749 'learn.microsoft.com/en-us/azure/reliability/availability-zones-enable-zone-resiliency':699 'learn.microsoft.com/en-us/azure/reliability/availability-zones-service-support':748 'learn.microsoft.com/en-us/azure/reliability/availability-zones-zonal-resource-resiliency':709 'learn.microsoft.com/en-us/azure/reliability/regions-multi-region-nonpaired':721 'learn.microsoft.com/en-us/azure/reliability/regions-nonregional-services':683 'learn.microsoft.com/en-us/azure/reliability/reliability-aks':367 'learn.microsoft.com/en-us/azure/reliability/reliability-api-center':376 'learn.microsoft.com/en-us/azure/reliability/reliability-app-service-environment':385 'learn.microsoft.com/en-us/azure/reliability/reliability-application-gateway-v2':395 'learn.microsoft.com/en-us/azure/reliability/reliability-bot':404 'learn.microsoft.com/en-us/azure/reliability/reliability-chaos-studio':413 'learn.microsoft.com/en-us/azure/reliability/reliability-cosmos-db-nosql':424 'learn.microsoft.com/en-us/azure/reliability/reliability-data-explorer':433 'learn.microsoft.com/en-us/azure/reliability/reliability-data-factory':442 'learn.microsoft.com/en-us/azure/reliability/reliability-database-mysql':453,463 'learn.microsoft.com/en-us/azure/reliability/reliability-database-postgresql':474 'learn.microsoft.com/en-us/azure/reliability/reliability-databricks':483 'learn.microsoft.com/en-us/azure/reliability/reliability-device-registry':493 'learn.microsoft.com/en-us/azure/reliability/reliability-documentdb':502 'learn.microsoft.com/en-us/azure/reliability/reliability-event-grid':512 'learn.microsoft.com/en-us/azure/reliability/reliability-event-hubs':522 'learn.microsoft.com/en-us/azure/reliability/reliability-fabric':531 'learn.microsoft.com/en-us/azure/reliability/reliability-functions':540,549 'learn.microsoft.com/en-us/azure/reliability/reliability-image-builder':559 'learn.microsoft.com/en-us/azure/reliability/reliability-iot-hub':570 'learn.microsoft.com/en-us/azure/reliability/reliability-load-balancer':580,590 'learn.microsoft.com/en-us/azure/reliability/reliability-logic-apps':600 'learn.microsoft.com/en-us/azure/reliability/reliability-managed-redis':610 'learn.microsoft.com/en-us/azure/reliability/reliability-monitor-logs':620 'learn.microsoft.com/en-us/azure/reliability/reliability-notification-hubs':629 'learn.microsoft.com/en-us/azure/reliability/reliability-site-recovery':640 'learn.microsoft.com/en-us/azure/reliability/reliability-sql-database':650 'learn.microsoft.com/en-us/azure/reliability/reliability-storage-queue':735 'learn.microsoft.com/en-us/azure/reliability/reliability-stream-analytics':660 'learn.microsoft.com/en-us/azure/reliability/reliability-vmware-solution':670 'limit':18,38,60,297,309,316,724,734 'line':96,108,209 'link':113,122 'load':578,588 'local':66 'locat':90 'log':615,619 'logic':598 'make':14,56,239,674 'manag':607 'markdown':189,205 'max':311 'mcp':148,175 'messag':234,307,312,732 'metadata':492 'metadata.generated':129 'microsoft':177,529 'microsoftdoc':149,176 'migrat':350 'monitor':236,618 'month':135 'multi':280,714 'multi-region':279,713 'mysql':30,336,449,461 'network':167,235 'nonpair':295,718 'nonregion':248,681 'nosql':423 'notif':627 'old':136 'ot':40 'outag':441 'pattern':17,59,216,266,282,688 'payload':325 'plan':284,398,693 'postgresql':473 'practic':12,54,212,321,357 'prefer':173 'provid':46 'pull':140 'queri':181,195 'queue':36,305,730 'quick':68 'quick-refer':67 'quota':19,61,298,725 'rang':97 'read':103,118 'recoveri':554,635,639 'redi':608 'redund':346 'refer':69,123 'region':281,296,715,720 'registri':491 'relev':91 'reliabl':3,8,42,51,371,384,399,408,487,516,526,604,624,654 'remot':72 'repositori':146 'requir':166 'resili':31,226,251,287,354,361,428,457,478,506,535,544,563,574,584,594,614,633,644,664,696 'resourc':707 'return':188,204 'section':92 'security.md':115,116 'select':677 'server':338 'servic':231,249,334,366,382,403,682,743 'setup':355 'site':638 'size':37,308,313,733 'skill':43,45,80,165,187,201 'skill-azure-reliability' 'solut':263,669,716 'source-microsoftdocs' 'specif':229 'specifi':107 'sql':648 'storag':306,731 'stream':521,657 'string':182,196 'studio':412 'suggest':137,154 'support':747 'text/markdown':203 'toler':261 'tool':150 'topic':358,675,689,726,739 'topic-agent' 'topic-agent-skills' 'topic-agentic-skills' 'topic-agentskill' 'topic-ai-agents' 'topic-ai-coding' 'topic-azure' 'topic-azure-functions' 'topic-azure-kubernetes-service' 'topic-azure-openai' 'topic-azure-sql-database' 'topic-azure-storage' 'understand':679,728 'url':359,676,690,727,740 'use':22,78,84,102,117,174,191,245,276,741 'user':139,156 'v2':394 'version':143 'vmware':668 'webpag':193 'workflow':595 'workload':288,665,698 'zonal':290,705 'zone':247,277,286,341,345,353,695,746 'zone-redund':344 'zone-resili':285,352,694 'zone/multi-region':25","prices":[{"id":"b84cad71-5f15-4fcd-be75-7c8187b53495","listingId":"bc688a07-d105-446b-ad0b-9ecf2dfbf1b0","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"MicrosoftDocs","category":"Agent-Skills","install_from":"skills.sh"},"createdAt":"2026-04-18T21:59:56.473Z"}],"sources":[{"listingId":"bc688a07-d105-446b-ad0b-9ecf2dfbf1b0","source":"github","sourceId":"MicrosoftDocs/Agent-Skills/azure-reliability","sourceUrl":"https://github.com/MicrosoftDocs/Agent-Skills/tree/main/skills/azure-reliability","isPrimary":false,"firstSeenAt":"2026-04-18T21:59:56.473Z","lastSeenAt":"2026-04-22T00:53:36.439Z"}],"details":{"listingId":"bc688a07-d105-446b-ad0b-9ecf2dfbf1b0","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"MicrosoftDocs","slug":"azure-reliability","github":{"repo":"MicrosoftDocs/Agent-Skills","stars":497,"topics":["agent","agent-skills","agentic-skills","agentskill","ai","ai-agents","ai-coding","azure","azure-functions","azure-kubernetes-service","azure-openai","azure-sql-database","azure-storage","azure-virtual-machine","claude-code","github-copilot","microsoft-learn","openai-codex","skills"],"license":"cc-by-4.0","html_url":"https://github.com/MicrosoftDocs/Agent-Skills","pushed_at":"2026-04-19T02:43:40Z","description":"Curated Agent Skills for Microsoft & Azure – giving AI coding assistants structured, real-time expertise from Microsoft Learn docs.","skill_md_sha":"1b1f9923bb182efa94361814568fecf63499a5d3","skill_md_path":"skills/azure-reliability/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/MicrosoftDocs/Agent-Skills/tree/main/skills/azure-reliability"},"layout":"multi","source":"github","category":"Agent-Skills","frontmatter":{"name":"azure-reliability","description":"Expert knowledge for Azure Reliability development including best practices, decision making, architecture & design patterns, limits & quotas, and deployment. Use when designing zone/multi-region apps, AZ-enabled MySQL, resilient Functions, AKS/DB HA, or Queue size limits, and other Azure Reliability related development tasks. Not for Azure Resiliency (use azure-resiliency), Azure Monitor (use azure-monitor), Azure Service Health (use azure-service-health), Chaos Studio (use azure-chaos-studio).","compatibility":"Requires network access. Uses mcp_microsoftdocs:microsoft_docs_fetch or fetch_webpage to retrieve documentation."},"skills_sh_url":"https://skills.sh/MicrosoftDocs/Agent-Skills/azure-reliability"},"updatedAt":"2026-04-22T00:53:36.439Z"}}