{"id":"ccb8a0f8-4ceb-4ce9-9300-f2382fb514db","shortId":"zFTenU","kind":"skill","title":"server-management","tagline":"Server management principles and decision-making. Process management, monitoring strategy, and scaling decisions. Teaches thinking, not commands.","description":"# Server Management\n\n> Server management principles for production operations.\n> **Learn to THINK, not memorize commands.**\n\n---\n\n## 1. Process Management Principles\n\n### Tool Selection\n\n| Scenario | Tool |\n|----------|------|\n| **Node.js app** | PM2 (clustering, reload) |\n| **Any app** | systemd (Linux native) |\n| **Containers** | Docker/Podman |\n| **Orchestration** | Kubernetes, Docker Swarm |\n\n### Process Management Goals\n\n| Goal | What It Means |\n|------|---------------|\n| **Restart on crash** | Auto-recovery |\n| **Zero-downtime reload** | No service interruption |\n| **Clustering** | Use all CPU cores |\n| **Persistence** | Survive server reboot |\n\n---\n\n## 2. Monitoring Principles\n\n### What to Monitor\n\n| Category | Key Metrics |\n|----------|-------------|\n| **Availability** | Uptime, health checks |\n| **Performance** | Response time, throughput |\n| **Errors** | Error rate, types |\n| **Resources** | CPU, memory, disk |\n\n### Alert Severity Strategy\n\n| Level | Response |\n|-------|----------|\n| **Critical** | Immediate action |\n| **Warning** | Investigate soon |\n| **Info** | Review daily |\n\n### Monitoring Tool Selection\n\n| Need | Options |\n|------|---------|\n| Simple/Free | PM2 metrics, htop |\n| Full observability | Grafana, Datadog |\n| Error tracking | Sentry |\n| Uptime | UptimeRobot, Pingdom |\n\n---\n\n## 3. Log Management Principles\n\n### Log Strategy\n\n| Log Type | Purpose |\n|----------|---------|\n| **Application logs** | Debug, audit |\n| **Access logs** | Traffic analysis |\n| **Error logs** | Issue detection |\n\n### Log Principles\n\n1. **Rotate logs** to prevent disk fill\n2. **Structured logging** (JSON) for parsing\n3. **Appropriate levels** (error/warn/info/debug)\n4. **No sensitive data** in logs\n\n---\n\n## 4. Scaling Decisions\n\n### When to Scale\n\n| Symptom | Solution |\n|---------|----------|\n| High CPU | Add instances (horizontal) |\n| High memory | Increase RAM or fix leak |\n| Slow response | Profile first, then scale |\n| Traffic spikes | Auto-scaling |\n\n### Scaling Strategy\n\n| Type | When to Use |\n|------|-------------|\n| **Vertical** | Quick fix, single instance |\n| **Horizontal** | Sustainable, distributed |\n| **Auto** | Variable traffic |\n\n---\n\n## 5. Health Check Principles\n\n### What Constitutes Healthy\n\n| Check | Meaning |\n|-------|---------|\n| **HTTP 200** | Service responding |\n| **Database connected** | Data accessible |\n| **Dependencies OK** | External services reachable |\n| **Resources OK** | CPU/memory not exhausted |\n\n### Health Check Implementation\n\n- Simple: Just return 200\n- Deep: Check all dependencies\n- Choose based on load balancer needs\n\n---\n\n## 6. Security Principles\n\n| Area | Principle |\n|------|-----------|\n| **Access** | SSH keys only, no passwords |\n| **Firewall** | Only needed ports open |\n| **Updates** | Regular security patches |\n| **Secrets** | Environment vars, not files |\n| **Audit** | Log access and changes |\n\n---\n\n## 7. Troubleshooting Priority\n\nWhen something's wrong:\n\n1. **Check if running** (process status)\n2. **Check logs** (error messages)\n3. **Check resources** (disk, memory, CPU)\n4. **Check network** (ports, DNS)\n5. **Check dependencies** (database, APIs)\n\n---\n\n## 8. Anti-Patterns\n\n| ❌ Don't | ✅ Do |\n|----------|-------|\n| Run as root | Use non-root user |\n| Ignore logs | Set up log rotation |\n| Skip monitoring | Monitor from day one |\n| Manual restarts | Auto-restart config |\n| No backups | Regular backup schedule |\n\n---\n\n> **Remember:** A well-managed server is boring. That's the goal.\n\n## When to Use\nThis skill is applicable to execute the workflow or actions described in the overview.\n\n## Limitations\n- Use this skill only when the task clearly matches the scope described above.\n- Do not treat the output as a substitute for environment-specific validation, testing, or expert review.\n- Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.","tags":["server","management","antigravity","awesome","skills","sickn33","agent-skills","agentic-skills","ai-agent-skills","ai-agents","ai-coding","ai-workflows"],"capabilities":["skill","source-sickn33","skill-server-management","topic-agent-skills","topic-agentic-skills","topic-ai-agent-skills","topic-ai-agents","topic-ai-coding","topic-ai-workflows","topic-antigravity","topic-antigravity-skills","topic-claude-code","topic-claude-code-skills","topic-codex-cli","topic-codex-skills"],"categories":["antigravity-awesome-skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/sickn33/antigravity-awesome-skills/server-management","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add sickn33/antigravity-awesome-skills","source_repo":"https://github.com/sickn33/antigravity-awesome-skills","install_from":"skills.sh"}},"qualityScore":"0.700","qualityRationale":"deterministic score 0.70 from registry signals: · indexed on github topic:agent-skills · 34515 github stars · SKILL.md body (3,908 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-04-22T12:51:45.222Z","embedding":null,"createdAt":"2026-04-18T21:44:43.783Z","updatedAt":"2026-04-22T12:51:45.222Z","lastSeenAt":"2026-04-22T12:51:45.222Z","tsv":"'1':36,170,322 '2':89,177,328 '200':251,274 '3':147,183,333 '4':187,193,339 '5':241,344 '6':285 '7':315 '8':349 'access':160,257,290,312 'action':121,411 'add':203 'alert':114 'analysi':163 'anti':351 'anti-pattern':350 'api':348 'app':45,50 'applic':156,405 'appropri':184 'area':288 'ask':449 'audit':159,310 'auto':71,222,238,379 'auto-recoveri':70 'auto-restart':378 'auto-sc':221 'avail':98 'backup':383,385 'balanc':283 'base':280 'bore':394 'boundari':457 'categori':95 'chang':314 'check':101,243,248,269,276,323,329,334,340,345 'choos':279 'clarif':451 'clear':424 'cluster':47,80 'command':21,35 'config':381 'connect':255 'constitut':246 'contain':54 'core':84 'cpu':83,111,202,338 'cpu/memory':265 'crash':69 'criteria':460 'critic':119 'daili':127 'data':190,256 'databas':254,347 'datadog':140 'day':374 'debug':158 'decis':9,17,195 'decision-mak':8 'deep':275 'depend':258,278,346 'describ':412,428 'detect':167 'disk':113,175,336 'distribut':237 'dns':343 'docker':58 'docker/podman':55 'downtim':75 'environ':306,440 'environment-specif':439 'error':106,107,141,164,331 'error/warn/info/debug':186 'execut':407 'exhaust':267 'expert':445 'extern':260 'file':309 'fill':176 'firewal':296 'first':216 'fix':211,232 'full':137 'goal':62,63,398 'grafana':139 'health':100,242,268 'healthi':247 'high':201,206 'horizont':205,235 'htop':136 'http':250 'ignor':364 'immedi':120 'implement':270 'increas':208 'info':125 'input':454 'instanc':204,234 'interrupt':79 'investig':123 'issu':166 'json':180 'key':96,292 'kubernet':57 'leak':212 'learn':30 'level':117,185 'limit':416 'linux':52 'load':282 'log':148,151,153,157,161,165,168,172,179,192,311,330,365,368 'make':10 'manag':3,5,12,23,25,38,61,149,391 'manual':376 'match':425 'mean':66,249 'memor':34 'memori':112,207,337 'messag':332 'metric':97,135 'miss':462 'monitor':13,90,94,128,371,372 'nativ':53 'need':131,284,298 'network':341 'node.js':44 'non':361 'non-root':360 'observ':138 'ok':259,264 'one':375 'open':300 'oper':29 'option':132 'orchestr':56 'output':434 'overview':415 'pars':182 'password':295 'patch':304 'pattern':352 'perform':102 'permiss':455 'persist':85 'pingdom':146 'pm2':46,134 'port':299,342 'prevent':174 'principl':6,26,39,91,150,169,244,287,289 'prioriti':317 'process':11,37,60,326 'product':28 'profil':215 'purpos':155 'quick':231 'ram':209 'rate':108 'reachabl':262 'reboot':88 'recoveri':72 'regular':302,384 'reload':48,76 'rememb':387 'requir':453 'resourc':110,263,335 'respond':253 'respons':103,118,214 'restart':67,377,380 'return':273 'review':126,446 'root':358,362 'rotat':171,369 'run':325,356 'safeti':456 'scale':16,194,198,218,223,224 'scenario':42 'schedul':386 'scope':427 'secret':305 'secur':286,303 'select':41,130 'sensit':189 'sentri':143 'server':2,4,22,24,87,392 'server-manag':1 'servic':78,252,261 'set':366 'sever':115 'simpl':271 'simple/free':133 'singl':233 'skill':403,419 'skill-server-management' 'skip':370 'slow':213 'solut':200 'someth':319 'soon':124 'source-sickn33' 'specif':441 'spike':220 'ssh':291 'status':327 'stop':447 'strategi':14,116,152,225 'structur':178 'substitut':437 'success':459 'surviv':86 'sustain':236 'swarm':59 'symptom':199 'systemd':51 'task':423 'teach':18 'test':443 'think':19,32 'throughput':105 'time':104 'tool':40,43,129 'topic-agent-skills' 'topic-agentic-skills' 'topic-ai-agent-skills' 'topic-ai-agents' 'topic-ai-coding' 'topic-ai-workflows' 'topic-antigravity' 'topic-antigravity-skills' 'topic-claude-code' 'topic-claude-code-skills' 'topic-codex-cli' 'topic-codex-skills' 'track':142 'traffic':162,219,240 'treat':432 'troubleshoot':316 'type':109,154,226 'updat':301 'uptim':99,144 'uptimerobot':145 'use':81,229,359,401,417 'user':363 'valid':442 'var':307 'variabl':239 'vertic':230 'warn':122 'well':390 'well-manag':389 'workflow':409 'wrong':321 'zero':74 'zero-downtim':73","prices":[{"id":"ac4439a7-a4f5-4b28-be45-4b55308b9135","listingId":"ccb8a0f8-4ceb-4ce9-9300-f2382fb514db","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"sickn33","category":"antigravity-awesome-skills","install_from":"skills.sh"},"createdAt":"2026-04-18T21:44:43.783Z"}],"sources":[{"listingId":"ccb8a0f8-4ceb-4ce9-9300-f2382fb514db","source":"github","sourceId":"sickn33/antigravity-awesome-skills/server-management","sourceUrl":"https://github.com/sickn33/antigravity-awesome-skills/tree/main/skills/server-management","isPrimary":false,"firstSeenAt":"2026-04-18T21:44:43.783Z","lastSeenAt":"2026-04-22T12:51:45.222Z"}],"details":{"listingId":"ccb8a0f8-4ceb-4ce9-9300-f2382fb514db","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"sickn33","slug":"server-management","github":{"repo":"sickn33/antigravity-awesome-skills","stars":34515,"topics":["agent-skills","agentic-skills","ai-agent-skills","ai-agents","ai-coding","ai-workflows","antigravity","antigravity-skills","claude-code","claude-code-skills","codex-cli","codex-skills","cursor","cursor-skills","developer-tools","gemini-cli","gemini-skills","kiro","mcp","skill-library"],"license":"mit","html_url":"https://github.com/sickn33/antigravity-awesome-skills","pushed_at":"2026-04-22T06:40:00Z","description":"Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.","skill_md_sha":"a0562f11a8e01918667b4e386542f05141b73930","skill_md_path":"skills/server-management/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/sickn33/antigravity-awesome-skills/tree/main/skills/server-management"},"layout":"multi","source":"github","category":"antigravity-awesome-skills","frontmatter":{"name":"server-management","description":"Server management principles and decision-making. Process management, monitoring strategy, and scaling decisions. Teaches thinking, not commands."},"skills_sh_url":"https://skills.sh/sickn33/antigravity-awesome-skills/server-management"},"updatedAt":"2026-04-22T12:51:45.222Z"}}