{"id":"2ba9c2d6-d2e2-4d53-a08d-2440049c493f","shortId":"QagXum","kind":"skill","title":"manage-slos","tagline":">-","description":"# Manage SLOs / SRM\n\n> **Limitation:** The MCP server does not currently expose `slo`, `slo_alert`, or\n> `monitored_service` resource types. SLO definitions, burn-rate alerts, and monitored-service\n> configuration must be created and edited via the Harness UI under\n> **Service Reliability Management**. This skill covers the parts of the SRM workflow that\n> *are* supported via MCP: deployment correlation, on-call handover reports, and operational\n> runbooks.\n\n## What this skill can do via MCP\n\n| Workflow | Supported today |\n|---|---|\n| Define an SLO or SLI | ❌ Use the Harness UI |\n| Configure error-budget / burn-rate alerts | ❌ Use the Harness UI |\n| Configure a monitored service | ❌ Use the Harness UI |\n| Correlate deployments with an incident | ✅ via `execution` |\n| Summarize recent releases for on-call handover | ✅ via `execution`, `service`, `environment` |\n| Draft an operational runbook | ✅ (LLM-authored; pulls context from MCP) |\n\n## Instructions\n\n### Step 1: Establish Scope\n\n```\nCall MCP tool: harness_list\nParameters:\n  resource_type: \"project\"\n  org_id: \"<organization>\"\n```\n\n### Step 2: Incident Triage — Correlate Deployments\n\nWhen the user reports an active incident:\n\n1. Identify the affected service and environment.\n2. Pull recent executions that deployed the service.\n\n```\nCall MCP tool: harness_list\nParameters:\n  resource_type: \"execution\"\n  org_id: \"<organization>\"\n  project_id: \"<project>\"\n  # filter by service or environment as needed\n```\n\n3. Correlate incident start time with deployment timestamps.\n4. Pull the failing execution's details:\n\n```\nCall MCP tool: harness_get\nParameters:\n  resource_type: \"execution\"\n  resource_id: \"<execution_id>\"\n  org_id: \"<organization>\"\n  project_id: \"<project>\"\n```\n\n5. Guide the user through structured RCA: blast radius, suspected root cause, mitigation steps, rollback candidate.\n\n### Step 3: On-Call Handover Report\n\nGather from the user: outgoing/incoming engineers, shift window, owned services.\n\nPull recent executions and services:\n\n```\nCall MCP tool: harness_list\nParameters:\n  resource_type: \"execution\"\n  org_id: \"<organization>\"\n  project_id: \"<project>\"\n```\n\n```\nCall MCP tool: harness_list\nParameters:\n  resource_type: \"service\"\n  org_id: \"<organization>\"\n  project_id: \"<project>\"\n```\n\nGenerate a structured handover covering: active/recent incidents the user describes,\ndeployments during the shift, services with elevated failure counts, and items\nrequiring attention.\n\n> For SLO burn-rate and error-budget status, direct the user to the SRM UI —\n> the MCP server does not expose these metrics.\n\n### Step 4: Operational Runbook (LLM-Authored)\n\nGather from the user: service name, team, tech stack, dependencies, SLO targets, common\nfailure modes.\n\nStructure the runbook with:\n- Service overview (purpose, owners, tech stack)\n- Health checks (pointers to SRM monitored-service dashboard in the Harness UI)\n- Common alerts with response procedures\n- Escalation paths\n- Rollback procedures (reference relevant pipelines via `harness_list resource_type: pipeline`)\n- Dependency contacts\n\n## Defining SLOs (UI-Only Today)\n\nWhen the user asks to define an SLO, burn-rate alert, or monitored service, respond:\n\n1. Gather requirements (service tier, health sources, SLO targets, rolling window, SLI type).\n2. Explain that SLO CRUD is not exposed via MCP today and link the user to the Harness SRM UI.\n3. Offer to draft the SLO spec (name, target %, SLI type, burn-rate thresholds) as text the\n   user can paste into the UI.\n4. Suggested burn-rate alert windows: 14.4×/1h (page), 6×/6h (ticket), 1×/3d (log).\n\n## Examples\n\n- \"Our payment service is down, help me triage\" — Pull recent `execution`s for the service, correlate with incident start, suggest rollback candidate.\n- \"Generate an on-call handover report\" — Pull executions and services during the shift, summarize with active issues.\n- \"Create a runbook for the auth-service\" — Draft runbook using MCP to list pipelines/services/environments for accurate references.\n- \"Define SLOs for our payment-gateway service\" — Draft the SLO spec as text; point to the Harness SRM UI for creation.\n- \"Configure burn-rate alerts\" — Draft the alert config; point to the SRM UI.\n\n## Performance Notes\n\n- When correlating incidents with deployments, pull a wide enough execution window (±30 min) to catch slow-burn failures.\n- For handover reports, include both successful and failed executions — a streak of successes is useful context.\n\n## Troubleshooting\n\n### \"SLO not found\" or \"Monitored service not found\"\n- These resources are not exposed by the MCP server today. Manage them in the Harness UI under Service Reliability Management.\n\n### Incident correlation missing executions\n- Confirm `org_id` and `project_id` scope the service\n- Broaden the execution filter time window\n- Check that the service's pipelines actually ran (no deploy = no execution)","tags":["manage","slos","harness","skills","agent-skills","agents"],"capabilities":["skill","source-harness","skill-manage-slos","topic-agent-skills","topic-agents"],"categories":["harness-skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/harness/harness-skills/manage-slos","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add harness/harness-skills","source_repo":"https://github.com/harness/harness-skills","install_from":"skills.sh"}},"qualityScore":"0.457","qualityRationale":"deterministic score 0.46 from registry signals: · indexed on github topic:agent-skills · 15 github stars · SKILL.md body (4,884 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T19:06:30.822Z","embedding":null,"createdAt":"2026-05-09T01:05:29.470Z","updatedAt":"2026-05-18T19:06:30.822Z","lastSeenAt":"2026-05-18T19:06:30.822Z","tsv":"'/1h':498 '/3d':504 '/6h':501 '1':142,169,433,503 '14.4':497 '2':157,176,446 '3':204,251,466 '30':614 '4':212,347,490 '5':234 '6':500 'accur':563 'activ':167,545 'active/recent':303 'actual':692 'affect':172 'alert':17,28,97,392,428,495,591,594 'ask':420 'attent':320 'auth':553 'auth-servic':552 'author':135,352 'blast':241 'broaden':680 'budget':93,329 'burn':26,95,324,426,478,493,589,620 'burn-rat':25,94,323,425,477,492,588 'call':65,123,145,184,219,254,272,285,533 'candid':249,528 'catch':617 'caus':245 'check':379,686 'common':365,391 'config':595 'configur':33,90,102,587 'confirm':671 'contact':410 'context':137,637 'correl':62,110,160,205,522,604,668 'count':316 'cover':49,302 'creat':36,547 'creation':586 'crud':450 'current':13 'dashboard':386 'defin':81,411,422,565 'definit':24 'depend':362,409 'deploy':61,111,161,181,210,308,607,695 'describ':307 'detail':218 'direct':331 'draft':129,469,555,573,592 'edit':38 'elev':314 'engin':262 'enough':611 'environ':128,175,201 'error':92,328 'error-budget':91,327 'escal':396 'establish':143 'exampl':506 'execut':116,126,179,192,216,227,269,280,517,537,612,630,670,682,697 'explain':447 'expos':14,343,453,651 'fail':215,629 'failur':315,366,621 'filter':197,683 'found':641,646 'gateway':571 'gather':257,353,434 'generat':298,529 'get':223 'guid':235 'handov':66,124,255,301,534,623 'har':41,88,100,108,148,187,222,275,288,389,404,463,582,661 'health':378,438 'help':512 'id':155,194,196,229,231,233,282,284,295,297,673,676 'identifi':170 'incid':114,158,168,206,304,524,605,667 'includ':625 'instruct':140 'issu':546 'item':318 'limit':7 'link':458 'list':149,188,276,289,405,560 'llm':134,351 'llm-author':133,350 'log':505 'manag':2,4,46,657,666 'manage-slo':1 'mcp':9,60,77,139,146,185,220,273,286,339,455,558,654 'metric':345 'min':615 'miss':669 'mitig':246 'mode':367 'monitor':19,31,104,384,430,643 'monitored-servic':30,383 'must':34 'name':358,473 'need':203 'note':602 'offer':467 'on-cal':63,121,252,531 'oper':69,131,348 'org':154,193,230,281,294,672 'outgoing/incoming':261 'overview':373 'own':265 'owner':375 'page':499 'paramet':150,189,224,277,290 'part':51 'past':486 'path':397 'payment':508,570 'payment-gateway':569 'perform':601 'pipelin':402,408,691 'pipelines/services/environments':561 'point':579,596 'pointer':380 'procedur':395,399 'project':153,195,232,283,296,675 'pull':136,177,213,267,515,536,608 'purpos':374 'radius':242 'ran':693 'rate':27,96,325,427,479,494,590 'rca':240 'recent':118,178,268,516 'refer':400,564 'releas':119 'relev':401 'reliabl':45,665 'report':67,165,256,535,624 'requir':319,435 'resourc':21,151,190,225,228,278,291,406,648 'respond':432 'respons':394 'roll':442 'rollback':248,398,527 'root':244 'runbook':70,132,349,370,549,556 'scope':144,677 'server':10,340,655 'servic':20,32,44,105,127,173,183,199,266,271,293,312,357,372,385,431,436,509,521,539,554,572,644,664,679,689 'shift':263,311,542 'skill':48,73 'skill-manage-slos' 'sli':85,444,475 'slo':15,16,23,83,322,363,424,440,449,471,575,639 'slos':3,5,412,566 'slow':619 'slow-burn':618 'sourc':439 'source-harness' 'spec':472,576 'srm':6,54,336,382,464,583,599 'stack':361,377 'start':207,525 'status':330 'step':141,156,247,250,346 'streak':632 'structur':239,300,368 'success':627,634 'suggest':491,526 'summar':117,543 'support':58,79 'suspect':243 'target':364,441,474 'team':359 'tech':360,376 'text':482,578 'threshold':480 'ticket':502 'tier':437 'time':208,684 'timestamp':211 'today':80,416,456,656 'tool':147,186,221,274,287 'topic-agent-skills' 'topic-agents' 'triag':159,514 'troubleshoot':638 'type':22,152,191,226,279,292,407,445,476 'ui':42,89,101,109,337,390,414,465,489,584,600,662 'ui-on':413 'use':86,98,106,557,636 'user':164,237,260,306,333,356,419,460,484 'via':39,59,76,115,125,403,454 'wide':610 'window':264,443,496,613,685 'workflow':55,78","prices":[{"id":"633f5082-456e-49b3-a675-5fa946ccc0a5","listingId":"2ba9c2d6-d2e2-4d53-a08d-2440049c493f","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"harness","category":"harness-skills","install_from":"skills.sh"},"createdAt":"2026-05-09T01:05:29.470Z"}],"sources":[{"listingId":"2ba9c2d6-d2e2-4d53-a08d-2440049c493f","source":"github","sourceId":"harness/harness-skills/manage-slos","sourceUrl":"https://github.com/harness/harness-skills/tree/main/skills/manage-slos","isPrimary":false,"firstSeenAt":"2026-05-09T01:05:29.470Z","lastSeenAt":"2026-05-18T19:06:30.822Z"}],"details":{"listingId":"2ba9c2d6-d2e2-4d53-a08d-2440049c493f","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"harness","slug":"manage-slos","github":{"repo":"harness/harness-skills","stars":15,"topics":["agent-skills","agents"],"license":"apache-2.0","html_url":"https://github.com/harness/harness-skills","pushed_at":"2026-05-13T01:28:28Z","description":"A collection of structured AI agent skills that   enable Claude Code, Cursor, GitHub Copilot, and   other AI coding assistants to create, operate,   debug, and govern Harness CI/CD workflows through   natural language.","skill_md_sha":"f46e83caf5104195a832713c8cc9f97e08004c71","skill_md_path":"skills/manage-slos/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/harness/harness-skills/tree/main/skills/manage-slos"},"layout":"multi","source":"github","category":"harness-skills","frontmatter":{"name":"manage-slos","license":"Apache-2.0","description":">-","compatibility":"Requires Harness MCP v2 server (harness-mcp-v2).\nNOTE: SRM CRUD resources (`slo`, `slo_alert`, `monitored_service`) are NOT currently exposed\nby the MCP server. Creating/managing SLOs and monitored services must be done in the Harness\nUI. This skill focuses on the incident-correlation and reporting workflows that the MCP server\ndoes support today (via `execution`, `service`, `environment`)."},"skills_sh_url":"https://skills.sh/harness/harness-skills/manage-slos"},"updatedAt":"2026-05-18T19:06:30.822Z"}}