{"id":"bae8df4a-edb9-4a9d-b977-da3e04051486","shortId":"KEjD2X","kind":"skill","title":"autonomous-agents","tagline":"Autonomous agents are AI systems that can independently decompose","description":"# Autonomous Agents\n\nAutonomous agents are AI systems that can independently decompose goals,\nplan actions, execute tools, and self-correct without constant human guidance.\nThe challenge isn't making them capable - it's making them reliable. Every\nextra decision multiplies failure probability.\n\nThis skill covers agent loops (ReAct, Plan-Execute), goal decomposition,\nreflection patterns, and production reliability. Key insight: compounding\nerror rates kill autonomous agents. A 95% success rate per step drops to\n60% by step 10. Build for reliability first, autonomy second.\n\n2025 lesson: The winners are constrained, domain-specific agents with clear\nboundaries, not \"autonomous everything.\" Treat AI outputs as proposals,\nnot truth.\n\n## Principles\n\n- Reliability over autonomy - every step compounds error probability\n- Constrain scope - domain-specific beats general-purpose\n- Treat outputs as proposals, not truth\n- Build guardrails before expanding capabilities\n- Human-in-the-loop for critical decisions is non-negotiable\n- Log everything - every action must be auditable\n- Fail safely with rollback, not silently with corruption\n\n## Capabilities\n\n- autonomous-agents\n- agent-loops\n- goal-decomposition\n- self-correction\n- reflection-patterns\n- react-pattern\n- plan-execute\n- agent-reliability\n- agent-guardrails\n\n## Scope\n\n- multi-agent-systems → multi-agent-orchestration\n- tool-building → agent-tool-builder\n- memory-systems → agent-memory-systems\n- workflow-orchestration → workflow-automation\n\n## Tooling\n\n### Frameworks\n\n- LangGraph - When: Production agents with state management Note: 1.0 released Oct 2025, checkpointing, human-in-loop\n- AutoGPT - When: Research/experimentation, open-ended exploration Note: Needs external guardrails for production\n- CrewAI - When: Role-based agent teams Note: Good for specialized agent collaboration\n- Claude Agent SDK - When: Anthropic ecosystem agents Note: Computer use, tool execution\n\n### Patterns\n\n- ReAct - When: Reasoning + Acting in alternating steps Note: Foundation for most modern agents\n- Plan-Execute - When: Separate planning from execution Note: Better for complex multi-step tasks\n- Reflection - When: Self-evaluation and correction Note: Evaluator-optimizer loop\n\n## Patterns\n\n### ReAct Agent Loop\n\nAlternating reasoning and action steps\n\n**When to use**: Interactive problem-solving, tool use, exploration\n\n# REACT PATTERN:\n\n\"\"\"\nThe ReAct loop:\n1. Thought: Reason about what to do next\n2. Action: Choose and execute a tool\n3. Observation: Receive result\n4. Repeat until goal achieved\n\nKey: Explicit reasoning traces make debugging possible\n\"\"\"\n\n## Basic ReAct Implementation\n\"\"\"\nfrom langchain.agents import create_react_agent\nfrom langchain_openai import ChatOpenAI\n\n# Define the ReAct prompt template\nreact_prompt = '''\nAnswer the question using the following format:\n\nQuestion: the input question\nThought: reason about what to do\nAction: tool_name\nAction Input: input to the tool\nObservation: result of the action\n... (repeat Thought/Action/Observation as needed)\nThought: I now know the final answer\nFinal Answer: the answer\n'''\n\n# Create the agent\nagent = create_react_agent(\n    llm=ChatOpenAI(model=\"gpt-4o\"),\n    tools=tools,\n    prompt=react_prompt,\n)\n\n# Execute with step limit\nresult = agent.invoke(\n    {\"input\": query},\n    config={\"max_iterations\": 10}  # Prevent runaway loops\n)\n\"\"\"\n\n## LangGraph ReAct (Production)\n\"\"\"\nfrom langgraph.prebuilt import create_react_agent\nfrom langgraph.checkpoint.postgres import PostgresSaver\n\n# Production checkpointer\ncheckpointer = PostgresSaver.from_conn_string(\n    os.environ[\"POSTGRES_URL\"]\n)\n\nagent = create_react_agent(\n    model=llm,\n    tools=tools,\n    checkpointer=checkpointer,  # Durable state\n)\n\n# Invoke with thread for state persistence\nconfig = {\"configurable\": {\"thread_id\": \"user-123\"}}\nresult = agent.invoke({\"messages\": [query]}, config)\n\"\"\"\n\n### Plan-Execute Pattern\n\nSeparate planning phase from execution\n\n**When to use**: Complex multi-step tasks, when full plan visibility matters\n\n# PLAN-EXECUTE PATTERN:\n\n\"\"\"\nTwo-phase approach:\n1. Planning: Decompose goal into subtasks\n2. Execution: Execute subtasks, potentially re-plan\n\nAdvantages:\n- Full visibility into plan before execution\n- Can validate/modify plan with human\n- Cleaner separation of concerns\n\nDisadvantages:\n- Less adaptive to mid-task discoveries\n- Plan may become stale\n\"\"\"\n\n## LangGraph Plan-Execute\n\"\"\"\nfrom langgraph.prebuilt import create_plan_and_execute_agent\n\n# Planner creates the task list\nplanner_prompt = '''\nFor the given objective, create a step-by-step plan.\nEach step should be atomic and actionable.\nFormat: numbered list of steps.\n'''\n\n# Executor handles individual steps\nexecutor_prompt = '''\nYou are executing step {step_number} of the plan.\nPrevious results: {previous_results}\nCurrent step: {current_step}\nExecute this step using available tools.\n'''\n\nagent = create_plan_and_execute_agent(\n    planner=planner_llm,\n    executor=executor_llm,\n    tools=tools,\n    replan_on_error=True,  # Re-plan if step fails\n)\n\n# Human approval of plan\nconfig = {\n    \"configurable\": {\n        \"thread_id\": \"task-456\",\n    },\n    \"interrupt_before\": [\"execute\"],  # Pause before execution\n}\n\n# First call creates plan\nplan = agent.invoke({\"objective\": goal}, config)\n\n# Review plan, then continue\nif human_approves(plan):\n    result = agent.invoke(None, config)  # Continue from checkpoint\n\"\"\"\n\n## Decomposition Strategies\n\"\"\"\n# Decomposition-First: Plan everything, then execute\n# Best for: Stable tasks, need full plan approval\n\n# Interleaved: Plan one step, execute, repeat\n# Best for: Dynamic tasks, learning as you go\n\ndef interleaved_execute(goal, max_steps=10):\n    state = {\"goal\": goal, \"completed\": [], \"remaining\": [goal]}\n\n    for step in range(max_steps):\n        # Plan next action based on current state\n        next_action = planner.plan_next(state)\n\n        if next_action == \"DONE\":\n            break\n\n        # Execute and update state\n        result = executor.execute(next_action)\n        state[\"completed\"].append((next_action, result))\n\n        # Re-evaluate remaining work\n        state[\"remaining\"] = planner.reassess(state)\n\n    return state\n\"\"\"\n\n### Reflection Pattern\n\nSelf-evaluation and iterative improvement\n\n**When to use**: Quality matters, complex outputs, creative tasks\n\n# REFLECTION PATTERN:\n\n\"\"\"\nSelf-correction loop:\n1. Generate initial output\n2. Evaluate against criteria\n3. Critique and identify issues\n4. Refine based on critique\n5. Repeat until satisfactory\n\nAlso called: Evaluator-Optimizer, Self-Critique\n\"\"\"\n\n## Basic Reflection\n\"\"\"\ndef reflect_and_improve(task, max_iterations=3):\n    # Initial generation\n    output = generator.generate(task)\n\n    for i in range(max_iterations):\n        # Evaluate output\n        critique = evaluator.critique(\n            task=task,\n            output=output,\n            criteria=[\n                \"Correctness\",\n                \"Completeness\",\n                \"Clarity\",\n            ]\n        )\n\n        if critique[\"passes_all\"]:\n            return output\n\n        # Refine based on critique\n        output = generator.refine(\n            task=task,\n            previous_output=output,\n            critique=critique[\"feedback\"],\n        )\n\n    return output  # Best effort after max iterations\n\"\"\"\n\n## LangGraph Reflection\n\"\"\"\nfrom langgraph.graph import StateGraph\n\ndef build_reflection_graph():\n    graph = StateGraph(ReflectionState)\n\n    # Nodes\n    graph.add_node(\"generate\", generate_node)\n    graph.add_node(\"reflect\", reflect_node)\n    graph.add_node(\"output\", output_node)\n\n    # Edges\n    graph.add_edge(\"generate\", \"reflect\")\n    graph.add_conditional_edges(\n        \"reflect\",\n        should_continue,\n        {\n            \"continue\": \"generate\",  # Loop back\n            \"end\": \"output\",\n        }\n    )\n\n    return graph.compile()\n\ndef should_continue(state):\n    if state[\"iteration\"] >= 3:\n        return \"end\"\n    if state[\"score\"] >= 0.9:\n        return \"end\"\n    return \"continue\"\n\"\"\"\n\n## Separate Evaluator (More Robust)\n\"\"\"\n# Use different model for evaluation to avoid self-bias\ngenerator = ChatOpenAI(model=\"gpt-4o\")\nevaluator = ChatOpenAI(model=\"gpt-4o-mini\")  # Different perspective\n\n# Or use specialized evaluators\nfrom langchain.evaluation import load_evaluator\nevaluator = load_evaluator(\"criteria\", criteria=\"correctness\")\n\"\"\"\n\n### Guardrailed Autonomy\n\nConstrained agents with safety boundaries\n\n**When to use**: Production systems, critical operations\n\n# GUARDRAILED AUTONOMY:\n\n\"\"\"\nProduction agents need multiple safety layers:\n1. Input validation\n2. Action constraints\n3. Output validation\n4. Cost limits\n5. Human escalation\n6. Rollback capability\n\"\"\"\n\n## Multi-Layer Guardrails\n\"\"\"\nclass GuardedAgent:\n    def __init__(self, agent, config):\n        self.agent = agent\n        self.max_cost = config.get(\"max_cost_usd\", 1.0)\n        self.max_steps = config.get(\"max_steps\", 10)\n        self.allowed_actions = config.get(\"allowed_actions\", [])\n        self.require_approval = config.get(\"require_approval\", [])\n\n    async def execute(self, goal):\n        total_cost = 0\n        steps = 0\n\n        while steps < self.max_steps:\n            # Get next action\n            action = await self.agent.plan_next(goal)\n\n            # Validate action is allowed\n            if action.name not in self.allowed_actions:\n                raise ActionNotAllowedError(action.name)\n\n            # Check if approval needed\n            if action.name in self.require_approval:\n                approved = await self.request_human_approval(action)\n                if not approved:\n                    return {\"status\": \"rejected\", \"action\": action}\n\n            # Estimate cost\n            estimated_cost = self.estimate_cost(action)\n            if total_cost + estimated_cost > self.max_cost:\n                raise CostLimitExceededError(total_cost)\n\n            # Execute with rollback capability\n            checkpoint = await self.save_checkpoint()\n            try:\n                result = await self.agent.execute(action)\n                total_cost += self.actual_cost(action)\n                steps += 1\n            except Exception as e:\n                await self.rollback_to(checkpoint)\n                raise\n\n            if result.is_complete:\n                break\n\n        return {\"status\": \"complete\", \"total_cost\": total_cost}\n\"\"\"\n\n## Least Privilege Principle\n\"\"\"\n# Define minimal permissions per task type\nTASK_PERMISSIONS = {\n    \"research\": [\"web_search\", \"read_file\"],\n    \"coding\": [\"read_file\", \"write_file\", \"run_tests\"],\n    \"admin\": [\"all\"],  # Rarely grant this\n}\n\ndef create_scoped_agent(task_type):\n    allowed = TASK_PERMISSIONS.get(task_type, [])\n    tools = [t for t in ALL_TOOLS if t.name in allowed]\n    return Agent(tools=tools)\n\"\"\"\n\n## Cost Control\n\"\"\"\n# Context length grows quadratically in cost\n# Double context = 4x cost\n\ndef trim_context(messages, max_tokens=4000):\n    # Keep system message and recent messages\n    system = messages[0]\n    recent = messages[-10:]\n\n    # Summarize middle if needed\n    if len(messages) > 11:\n        middle = messages[1:-10]\n        summary = summarize(middle)\n        return [system, summary] + recent\n\n    return messages\n\"\"\"\n\n### Durable Execution Pattern\n\nAgents that survive failures and resume\n\n**When to use**: Long-running tasks, production systems, multi-day processes\n\n# DURABLE EXECUTION:\n\n\"\"\"\nProduction agents must:\n- Survive server restarts\n- Resume from exact point of failure\n- Handle hours/days of runtime\n- Allow human intervention mid-process\n\nLangGraph 1.0 provides this natively.\n\"\"\"\n\n## LangGraph Checkpointing\n\"\"\"\nfrom langgraph.checkpoint.postgres import PostgresSaver\nfrom langgraph.graph import StateGraph\n\n# Production checkpointer (not MemorySaver!)\ncheckpointer = PostgresSaver.from_conn_string(\n    os.environ[\"POSTGRES_URL\"]\n)\n\n# Build graph with checkpointing\ngraph = StateGraph(AgentState)\n# ... add nodes and edges ...\n\nagent = graph.compile(checkpointer=checkpointer)\n\n# Each invocation saves state\nconfig = {\"configurable\": {\"thread_id\": \"long-task-789\"}}\n\n# Start task\nagent.invoke({\"goal\": complex_goal}, config)\n\n# If server dies, resume later:\nstate = agent.get_state(config)\nif not state.is_complete:\n    agent.invoke(None, config)  # Continues from checkpoint\n\"\"\"\n\n## Human-in-the-Loop Interrupts\n\"\"\"\n# Pause at specific nodes\nagent = graph.compile(\n    checkpointer=checkpointer,\n    interrupt_before=[\"critical_action\"],  # Pause before\n    interrupt_after=[\"validation\"],        # Pause after\n)\n\n# First invocation pauses at interrupt\nresult = agent.invoke({\"goal\": goal}, config)\n\n# Human reviews state\nstate = agent.get_state(config)\nif human_approves(state):\n    # Continue from pause point\n    agent.invoke(None, config)\nelse:\n    # Modify state and continue\n    agent.update_state(config, {\"approved\": False})\n    agent.invoke(None, config)\n\"\"\"\n\n## Time-Travel Debugging\n\"\"\"\n# LangGraph stores full history\nhistory = list(agent.get_state_history(config))\n\n# Go back to any previous state\npast_state = history[5]\nagent.update_state(config, past_state.values)\n\n# Replay from that point with modifications\nagent.invoke(None, config)\n\"\"\"\n\n## Sharp Edges\n\n### Error Probability Compounds Exponentially\n\nSeverity: CRITICAL\n\nSituation: Building multi-step autonomous agents\n\nSymptoms:\nAgent works in demos but fails in production. Simple tasks succeed,\ncomplex tasks fail mysteriously. Success rate drops dramatically\nas task complexity increases. Users lose trust.\n\nWhy this breaks:\nEach step has independent failure probability. A 95% success rate\nper step sounds great until you realize:\n- 5 steps: 77% success (0.95^5)\n- 10 steps: 60% success (0.95^10)\n- 20 steps: 36% success (0.95^20)\n\nThis is the fundamental limit of autonomous agents. Every additional\nstep multiplies failure probability.\n\nRecommended fix:\n\n## Reduce step count\n# Combine steps where possible\n# Prefer fewer, more capable steps over many small ones\n\n## Increase per-step reliability\n# Use structured outputs (JSON schemas)\n# Add validation at each step\n# Use better models for critical steps\n\n## Design for failure\nclass RobustAgent:\n    def execute_with_retry(self, step, max_retries=3):\n        for attempt in range(max_retries):\n            try:\n                result = step.execute()\n                if self.validate(result):\n                    return result\n            except Exception as e:\n                if attempt == max_retries - 1:\n                    raise\n                self.log_retry(step, attempt, e)\n\n## Break into checkpointed segments\n# Human review at each segment\n# Resume from last good checkpoint\n\n### API Costs Explode with Context Growth\n\nSeverity: CRITICAL\n\nSituation: Running agents with growing conversation context\n\nSymptoms:\n$47 to close a single support ticket. Thousands in surprise API bills.\nAgents getting slower as they run longer. Token counts exceeding\nmodel limits.\n\nWhy this breaks:\nTransformer costs scale quadratically with context length. Double\nthe context, quadruple the compute. A long-running agent that\nre-sends its full conversation each turn can burn money exponentially.\n\nMost agents append to context without trimming. Context grows:\n- Turn 1: 500 tokens → $0.01\n- Turn 10: 5000 tokens → $0.10\n- Turn 50: 25000 tokens → $0.50\n- Turn 100: 50000 tokens → $1.00+ per message\n\nRecommended fix:\n\n## Set hard cost limits\nclass CostLimitedAgent:\n    MAX_COST_PER_TASK = 1.00  # USD\n\n    def __init__(self):\n        self.total_cost = 0\n\n    def before_call(self, estimated_tokens):\n        estimated_cost = self.estimate_cost(estimated_tokens)\n        if self.total_cost + estimated_cost > self.MAX_COST_PER_TASK:\n            raise CostLimitExceeded(\n                f\"Would exceed ${self.MAX_COST_PER_TASK} limit\"\n            )\n\n    def after_call(self, response):\n        self.total_cost += self.calculate_actual_cost(response)\n\n## Trim context aggressively\ndef trim_context(messages, max_tokens=4000):\n    # Keep: system prompt + last N messages\n    # Summarize: everything in between\n    if count_tokens(messages) <= max_tokens:\n        return messages\n\n    system = messages[0]\n    recent = messages[-5:]\n    middle = messages[1:-5]\n\n    if middle:\n        summary = summarize(middle)  # Compress history\n        return [system, summary] + recent\n\n    return [system] + recent\n\n## Use streaming to track costs in real-time\n## Alert at 50% of budget, halt at 90%\n\n### Demo Works But Production Fails\n\nSeverity: CRITICAL\n\nSituation: Moving from prototype to production\n\nSymptoms:\nImpressive demo to stakeholders. Months of failure in production.\nWorks for the founder's use case, fails for real users. Edge cases\noverwhelm the system.\n\nWhy this breaks:\nDemos show the happy path with curated inputs. Production means:\n- Unexpected inputs (typos, ambiguity, adversarial)\n- Scale (1000 users, not 3)\n- Reliability (99.9% uptime, not \"usually works\")\n- Edge cases (the 1% that breaks everything)\n\nThe methodology is questionable, but the core problem is real.\nThe gap between a working demo and a reliable production system\nis where projects die.\n\nRecommended fix:\n\n## Test at scale before production\n# Run 1000+ test cases, not 10\n# Measure P95/P99 success rate, not average\n# Include adversarial inputs\n\n## Build observability first\nimport structlog\nlogger = structlog.get_logger()\n\nclass ObservableAgent:\n    def execute(self, task):\n        with logger.bind(task_id=task.id):\n            logger.info(\"task_started\")\n            try:\n                result = self._execute(task)\n                logger.info(\"task_completed\", result=result)\n                return result\n            except Exception as e:\n                logger.error(\"task_failed\", error=str(e))\n                raise\n\n## Have escape hatches\n# Human takeover when confidence < threshold\n# Graceful degradation to simpler behavior\n# \"I don't know\" is a valid response\n\n## Deploy incrementally\n# 1% of traffic, then 10%, then 50%\n# Monitor error rates at each stage\n\n### Agent Fabricates Data When Stuck\n\nSeverity: HIGH\n\nSituation: Agent can't complete task with available information\n\nSymptoms:\nAgent invents plausible-looking data. Fake restaurant names on expense\nreports. Made-up statistics in reports. Confident answers that are\ncompletely wrong.\n\nWhy this breaks:\nLLMs are trained to be helpful and produce plausible outputs. When\nstuck, they don't say \"I can't do this\" - they fabricate. Autonomous\nagents compound this by acting on fabricated data without human review.\n\nThe agent that fabricated expense entries was trying to meet its goal\n(complete the expense report). It \"solved\" the problem by inventing data.\n\nRecommended fix:\n\n## Validate against ground truth\ndef validate_expense(expense):\n    # Cross-check with external sources\n    if expense.restaurant:\n        if not verify_restaurant_exists(expense.restaurant):\n            raise ValidationError(\"Restaurant not found\")\n\n    # Check for suspicious patterns\n    if expense.amount == round(expense.amount, -1):\n        flag_for_review(\"Suspiciously round amount\")\n\n## Require evidence\nsystem_prompt = '''\nFor every factual claim, cite the specific tool output that\nsupports it. If you cannot find supporting evidence, say\n\"I could not verify this\" rather than guessing.\n'''\n\n## Use structured outputs\nfrom pydantic import BaseModel\n\nclass VerifiedClaim(BaseModel):\n    claim: str\n    source: str  # Must reference tool output\n    confidence: float\n\n## Detect uncertainty\n# Train to output confidence scores\n# Flag low-confidence outputs for human review\n# Never auto-execute on uncertain data\n\n### Integration Is Where Agents Die\n\nSeverity: HIGH\n\nSituation: Connecting agent to external systems\n\nSymptoms:\nWorks with mock APIs, fails with real ones. Rate limits cause crashes.\nAuth tokens expire mid-task. Data format mismatches. Partial failures\nleave systems in inconsistent state.\n\nWhy this breaks:\nThe companies promising \"autonomous agents that integrate with your\nentire tech stack\" haven't built production systems at scale.\nReal integrations have:\n- Rate limits (429 errors mid-task)\n- Auth complexity (OAuth refresh, token expiry)\n- Data format variations (API v1 vs v2)\n- Partial failures (webhook received, processing failed)\n- Eventual consistency (data not immediately available)\n\nRecommended fix:\n\n## Build robust API clients\nfrom tenacity import retry, stop_after_attempt, wait_exponential\n\nclass RobustAPIClient:\n    @retry(\n        stop=stop_after_attempt(3),\n        wait=wait_exponential(multiplier=1, min=4, max=60)\n    )\n    async def call(self, endpoint, data):\n        response = await self.client.post(endpoint, json=data)\n        if response.status_code == 429:\n            retry_after = response.headers.get(\"Retry-After\", 60)\n            await asyncio.sleep(int(retry_after))\n            raise RateLimitError()\n        return response\n\n## Handle auth lifecycle\nclass TokenManager:\n    def __init__(self):\n        self.token = None\n        self.expires_at = None\n\n    async def get_token(self):\n        if self.is_expired():\n            self.token = await self.refresh_token()\n        return self.token\n\n    def is_expired(self):\n        buffer = timedelta(minutes=5)  # Refresh early\n        return datetime.now() > (self.expires_at - buffer)\n\n## Use idempotency keys\n# Every external action should be idempotent\n# If agent retries, external system handles duplicate\n\n## Design for partial failure\n# Each step is independently recoverable\n# Checkpoint before external calls\n# Rollback capability for each integration\n\n### Agent Takes Dangerous Actions\n\nSeverity: HIGH\n\nSituation: Agent with broad permissions\n\nSymptoms:\nAgent deletes production data. Sends emails to wrong recipients.\nMakes purchases without approval. Modifies settings it shouldn't.\nActions that can't be undone.\n\nWhy this breaks:\nAgents optimize for their goal. Without guardrails, they'll take the\nshortest path - even if that path is destructive. An agent told to\n\"clean up the database\" might interpret that as \"delete everything.\"\n\nBroad permissions + autonomy + goal optimization = danger.\n\nRecommended fix:\n\n### Least privilege principle\nPERMISSIONS = {\n    \"research_agent\": [\"read_web\", \"read_docs\"],\n    \"code_agent\": [\"read_file\", \"write_file\", \"run_tests\"],\n    \"email_agent\": [\"read_email\", \"draft_email\"],  # NOT send\n    \"admin_agent\": [\"all\"],  # Rarely used\n}\n\n## Separate read/write permissions\n# Agent can read anything\n# Write requires explicit approval\n\n## Dangerous actions require confirmation\nDANGEROUS_ACTIONS = [\n    \"delete_*\",\n    \"send_email\",\n    \"transfer_money\",\n    \"modify_production\",\n    \"revoke_access\",\n]\n\nasync def execute_action(action):\n    if matches_dangerous_pattern(action):\n        approval = await request_human_approval(action)\n        if not approval:\n            return ActionRejected(action)\n    return await actually_execute(action)\n\n## Dry-run mode for testing\n# Agent describes what it would do\n# Human approves the plan\n# Then agent executes\n\n## Audit logging for everything\n# Every action logged with context\n# Who authorized it\n# What changed\n# How to reverse it\n\n### Agent Runs Out of Context Window\n\nSeverity: MEDIUM\n\nSituation: Long-running agent tasks\n\nSymptoms:\nAgent forgets earlier instructions. Contradicts itself. Loses track\nof the goal. Starts repeating itself. Model errors about token limits.\n\nWhy this breaks:\nEvery message, observation, and thought consumes context. Long tasks\nexhaust the window. When context is truncated:\n- System prompt gets dropped\n- Early important context lost\n- Agent loses coherence\n\nRecommended fix:\n\n## Track context usage\nclass ContextManager:\n    def __init__(self, max_tokens=100000):\n        self.max_tokens = max_tokens\n        self.messages = []\n\n    def add(self, message):\n        self.messages.append(message)\n        self.maybe_compact()\n\n    def maybe_compact(self):\n        if self.token_count() > self.max_tokens * 0.8:\n            self.compact()\n\n    def compact(self):\n        # Always keep: system prompt\n        system = self.messages[0]\n\n        # Always keep: last N messages\n        recent = self.messages[-10:]\n\n        # Summarize: everything else\n        middle = self.messages[1:-10]\n        if middle:\n            summary = summarize_messages(middle)\n            self.messages = [system, summary] + recent\n\n## Use external memory\n# Don't keep everything in context\n# Store in vector DB, retrieve when needed\n# See agent-memory-systems skill\n\n## Hierarchical summarization\n# Recent: full detail\n# Medium: key points\n# Old: compressed summary\n\n### Can't Debug What You Can't See\n\nSeverity: MEDIUM\n\nSituation: Agent fails mysteriously\n\nSymptoms:\n\"It just didn't work.\" No idea why agent failed. Can't reproduce\nissues. Users report problems you can't explain. Debugging is\nguesswork.\n\nWhy this breaks:\nAgents make dozens of internal decisions. Without visibility into\neach step, you're blind to failure modes. Production debugging\nwithout traces is impossible.\n\nRecommended fix:\n\n## Structured logging\nimport structlog\n\nlogger = structlog.get_logger()\n\nclass TracedAgent:\n    def think(self, context):\n        with logger.bind(step=\"think\"):\n            thought = self.llm.generate(context)\n            logger.info(\"thought_generated\",\n                thought=thought,\n                tokens=count_tokens(thought)\n            )\n            return thought\n\n    def act(self, action):\n        with logger.bind(step=\"act\", action=action.name):\n            logger.info(\"action_started\")\n            try:\n                result = action.execute()\n                logger.info(\"action_completed\", result=result)\n                return result\n            except Exception as e:\n                logger.error(\"action_failed\", error=str(e))\n                raise\n\n## Use LangSmith or similar\nfrom langsmith import trace\n\n@trace\ndef agent_step(state):\n    # Automatically traced with inputs/outputs\n    return next_state\n\n## Save full traces\n# Every step, every decision\n# Inputs and outputs\n# Latency at each step\n# Token usage\n\n## Validation Checks\n\n### Agent Loop Without Step Limit\n\nSeverity: ERROR\n\nAutonomous agents must have maximum step limits\n\nMessage: Agent loop without step limit. Add max_steps to prevent infinite loops.\n\n### No Cost Tracking or Limits\n\nSeverity: ERROR\n\nAgents should track and limit API costs\n\nMessage: Agent uses LLM without cost tracking. Add cost limits to prevent runaway spending.\n\n### Agent Without Timeout\n\nSeverity: WARNING\n\nLong-running agents need timeouts\n\nMessage: Agent invocation without timeout. Add timeout to prevent hung tasks.\n\n### MemorySaver Used in Production\n\nSeverity: ERROR\n\nMemorySaver is for development only\n\nMessage: MemorySaver is not persistent. Use PostgresSaver or SqliteSaver for production.\n\n### Long-Running Agent Without Checkpointing\n\nSeverity: WARNING\n\nAgents that run multiple steps need checkpointing\n\nMessage: Multi-step agent without checkpointing. Add checkpointer for durability.\n\n### Agent Without Thread ID\n\nSeverity: WARNING\n\nCheckpointed agents need unique thread IDs\n\nMessage: Agent invocation without thread_id. State won't persist correctly.\n\n### Using Agent Output Without Validation\n\nSeverity: WARNING\n\nAgent outputs should be validated before use\n\nMessage: Agent output used without validation. Validate before acting on results.\n\n### Agent Without Structured Output\n\nSeverity: INFO\n\nStructured outputs are more reliable\n\nMessage: Consider using structured outputs (Pydantic) for more reliable parsing.\n\n### Agent Without Error Recovery\n\nSeverity: WARNING\n\nAgents should handle and recover from errors\n\nMessage: Agent call without error handling. Add try/catch or error handler.\n\n### Destructive Actions Without Rollback\n\nSeverity: WARNING\n\nActions that modify state should be reversible\n\nMessage: Destructive action without rollback capability. Save state before modification.\n\n## Collaboration\n\n### Delegation Triggers\n\n- user needs multi-agent coordination -> multi-agent-orchestration (Multiple agents working together)\n- user needs to test/evaluate agent -> agent-evaluation (Benchmarking and testing)\n- user needs tools for agent -> agent-tool-builder (Tool design and implementation)\n- user needs persistent memory -> agent-memory-systems (Long-term memory architecture)\n- user needs workflow automation -> workflow-automation (When agent is overkill for the task)\n- user needs computer control -> computer-use-agents (GUI automation, screen interaction)\n\n## Related Skills\n\nWorks well with: `agent-tool-builder`, `agent-memory-systems`, `multi-agent-orchestration`, `agent-evaluation`\n\n## When to Use\n- User mentions or implies: autonomous agent\n- User mentions or implies: autogpt\n- User mentions or implies: babyagi\n- User mentions or implies: self-prompting\n- User mentions or implies: goal decomposition\n- User mentions or implies: react pattern\n- User mentions or implies: agent loop\n- User mentions or implies: self-correcting agent\n- User mentions or implies: reflection agent\n- User mentions or implies: langgraph\n- User mentions or implies: agentic ai\n- User mentions or implies: agent planning\n\n## Limitations\n- Use this skill only when the task clearly matches the scope described above.\n- Do not treat the output as a substitute for environment-specific validation, testing, or expert review.\n- Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.","tags":["autonomous","agents","antigravity","awesome","skills","sickn33","agent-skills","agentic-skills","ai-agent-skills","ai-agents","ai-coding","ai-workflows"],"capabilities":["skill","source-sickn33","skill-autonomous-agents","topic-agent-skills","topic-agentic-skills","topic-ai-agent-skills","topic-ai-agents","topic-ai-coding","topic-ai-workflows","topic-antigravity","topic-antigravity-skills","topic-claude-code","topic-claude-code-skills","topic-codex-cli","topic-codex-skills"],"categories":["antigravity-awesome-skills"],"synonyms":[],"warnings":[],"endpointUrl":"https://skills.sh/sickn33/antigravity-awesome-skills/autonomous-agents","protocol":"skill","transport":"skills-sh","auth":{"type":"none","details":{"cli":"npx skills add sickn33/antigravity-awesome-skills","source_repo":"https://github.com/sickn33/antigravity-awesome-skills","install_from":"skills.sh"}},"qualityScore":"0.700","qualityRationale":"deterministic score 0.70 from registry signals: · indexed on github topic:agent-skills · 37911 github stars · SKILL.md body (29,421 chars)","verified":false,"liveness":"unknown","lastLivenessCheck":null,"agentReviews":{"count":0,"score_avg":null,"cost_usd_avg":null,"success_rate":null,"latency_p50_ms":null,"narrative_summary":null,"summary_updated_at":null},"enrichmentModel":"deterministic:skill-github:v1","enrichmentVersion":1,"enrichedAt":"2026-05-18T18:50:31.723Z","embedding":null,"createdAt":"2026-04-18T20:37:20.303Z","updatedAt":"2026-05-18T18:50:31.723Z","lastSeenAt":"2026-05-18T18:50:31.723Z","tsv":"'-1':2352 '-10':1335,1347,2990,2997 '-123':532 '-456':714 '-5':1975,1979 '0':1143,1145,1332,1899,1972,2982 '0.01':1862 '0.10':1867 '0.50':1872 '0.8':2971 '0.9':1011 '0.95':1651,1657,1663 '1':356,568,860,1082,1231,1346,1754,1859,1978,2082,2200,2558,2996 '1.0':243,1119,1404 '1.00':1877,1892 '10':90,483,782,1125,1653,1658,1864,2123,2204 '100':1874 '1000':2069,2119 '100000':2948 '11':1343 '2':364,574,864,1085 '20':1659,1664 '2025':97,246 '25000':1870 '3':371,868,899,1005,1088,1731,2072,2553 '36':1661 '4':375,873,1091,2560 '4000':1323,1951 '429':2501,2578 '47':1791 '4o':466,1035,1041 '4x':1315 '5':878,1094,1571,1647,1652,2629 '50':1869,2005,2206 '500':1860 '5000':1865 '50000':1875 '6':1097 '60':87,1655,2562,2585 '77':1649 '789':1455 '90':2010 '95':80,1637 '99.9':2074 'access':2807 'achiev':379 'act':294,2285,3140,3146,3381 'action':26,164,339,365,425,428,438,646,797,803,809,819,824,1086,1127,1130,1152,1153,1159,1167,1185,1192,1193,1200,1224,1229,1499,2642,2674,2701,2794,2798,2811,2812,2817,2823,2829,2834,2859,3142,3147,3150,3156,3167,3430,3435,3444 'action.execute':3154 'action.name':1163,1170,1176,3148 'actionnotallowederror':1169 'actionreject':2828 'actual':1939,2832 'adapt':600 'add':1436,1707,2955,3231,3259,3282,3332,3424 'addit':1674 'admin':1275,2777 'advantag':582 'adversari':2067,2131 'agent':3,5,14,16,58,78,106,179,181,199,202,207,211,217,224,238,270,276,279,284,303,334,395,456,457,460,495,509,512,621,681,686,1063,1077,1109,1112,1283,1302,1360,1382,1440,1492,1599,1601,1672,1785,1803,1835,1850,2213,2221,2230,2281,2293,2435,2441,2481,2647,2671,2678,2683,2710,2730,2756,2762,2770,2778,2785,2841,2852,2872,2884,2887,2933,3026,3052,3064,3083,3183,3211,3219,3226,3245,3253,3266,3274,3278,3313,3318,3329,3336,3343,3349,3360,3366,3374,3384,3405,3411,3419,3459,3463,3466,3473,3475,3484,3486,3498,3514,3527,3538,3542,3547,3550,3560,3594,3603,3609,3619,3625 'agent-evalu':3474,3549 'agent-guardrail':201 'agent-loop':180 'agent-memory-system':223,3025,3497,3541 'agent-reli':198 'agent-tool-build':216,3485,3537 'agent.get':1469,1521,1558 'agent.invoke':477,534,726,739,1458,1476,1513,1532,1545,1582 'agent.update':1540,1572 'agentst':1435 'aggress':1944 'ai':7,18,114,3620 'alert':2003 'allow':1129,1161,1286,1300,1397 'also':882 'altern':296,336 'alway':2976,2983 'ambigu':2066 'amount':2358 'answer':408,449,451,453,2249 'anthrop':282 'anyth':2788 'api':1775,1801,2449,2515,2535,3250 'append':822,1851 'approach':567 'approv':706,736,761,1132,1135,1173,1179,1180,1184,1188,1526,1543,2695,2792,2818,2822,2826,2848 'architectur':3505 'ask':3660 'async':1136,2563,2608,2808 'asyncio.sleep':2587 'atom':644 'attempt':1733,1751,1759,2543,2552 'audit':167,2854 'auth':2458,2506,2596 'author':2864 'auto':2427 'auto-execut':2426 'autogpt':252,3565 'autom':232,3509,3512,3529 'automat':3186 'autonom':2,4,13,15,77,111,178,1598,1671,2280,2480,3218,3559 'autonomi':95,123,1061,1075,2745 'autonomous-ag':1,177 'avail':679,2227,2530 'averag':2129 'avoid':1026 'await':1154,1181,1217,1222,1236,2570,2586,2617,2819,2831 'babyagi':3570 'back':993,1563 'base':269,798,875,930 'basemodel':2396,2399 'basic':387,890 'beat':134 'becom':608 'behavior':2189 'benchmark':3477 'best':754,768,945 'better':313,1713 'bias':1029 'bill':1802 'blind':3096 'boundari':109,1066,3668 'break':811,1244,1629,1761,1817,2052,2084,2256,2476,2709,2908,3082 'broad':2680,2743 'budget':2007 'buffer':2626,2636 'build':91,144,215,957,1429,1594,2133,2533 'builder':219,3488,3540 'built':2491 'burn':1846 'call':722,883,1902,1933,2565,2665,3420 'cannot':2377 'capabl':43,148,176,1099,1215,1691,2667,3447 'case':2040,2046,2080,2121 'caus':2456 'challeng':38 'chang':2867 'chatopenai':400,462,1031,1037 'check':1171,2327,2344,3210 'checkpoint':247,501,502,517,518,744,1216,1219,1239,1409,1419,1422,1432,1442,1443,1481,1494,1495,1763,1774,2662,3315,3324,3331,3333,3342 'choos':366 'cite':2367 'claim':2366,2400 'clarif':3662 'clariti':922 'class':1104,1721,1886,2141,2397,2546,2598,2941,3115 'claud':278 'clean':2733 'cleaner':594 'clear':108,3635 'client':2536 'close':1793 'code':1268,2577,2761 'coher':2935 'collabor':277,3452 'combin':1684 'compact':2961,2964,2974 'compani':2478 'complet':786,821,921,1243,1247,1475,2161,2224,2252,2304,3157 'complex':315,550,850,1460,1612,1622,2507 'compound':73,126,1589,2282 'compress':1985,3039 'comput':286,1830,3522,3525 'computer-use-ag':3524 'concern':597 'condit':985 'confid':2183,2248,2408,2415,2420 'config':480,527,537,709,729,741,1110,1448,1462,1471,1478,1516,1523,1534,1542,1547,1561,1574,1584 'config.get':1115,1122,1128,1133 'configur':528,710,1449 'confirm':2796 'conn':504,1424 'connect':2440 'consid':3396 'consist':2526 'constant':34 'constrain':102,129,1062 'constraint':1087 'consum':2914 'context':1307,1314,1319,1779,1789,1823,1827,1853,1856,1943,1947,2862,2876,2915,2922,2931,2939,3016,3120,3127 'contextmanag':2942 'continu':733,742,989,990,1000,1015,1479,1528,1539 'contradict':2891 'control':1306,3523 'convers':1788,1842 'coordin':3460 'core':2092 'correct':32,188,326,858,920,1059,3358,3602 'corrupt':175 'cost':1092,1114,1117,1142,1195,1197,1199,1203,1205,1207,1211,1226,1228,1249,1251,1305,1312,1316,1776,1819,1884,1889,1898,1907,1909,1914,1916,1918,1927,1937,1940,1998,3239,3251,3257,3260 'costlimitedag':1887 'costlimitexceed':1922 'costlimitexceedederror':1209 'could':2383 'count':1683,1811,1963,2968,3134 'cover':57 'crash':2457 'creat':393,454,458,493,510,617,623,633,682,723,1281 'creativ':852 'crewai':265 'criteria':867,919,1057,1058,3671 'critic':155,1072,1498,1592,1716,1782,2017 'critiqu':869,877,889,913,924,932,940,941 'cross':2326 'cross-check':2325 'curat':2059 'current':671,673,800 'danger':2673,2748,2793,2797,2815 'data':2215,2235,2288,2314,2431,2464,2512,2527,2568,2574,2686 'databas':2736 'datetime.now':2633 'day':1377 'db':3020 'debug':385,1551,3043,3077,3101 'decis':51,156,3088,3199 'decompos':12,23,570 'decomposit':65,185,745,748,3583 'decomposition-first':747 'def':776,892,956,998,1106,1137,1280,1317,1723,1894,1900,1931,1945,2143,2321,2564,2600,2609,2622,2809,2943,2954,2962,2973,3117,3139,3182 'defin':401,1255 'degrad':2186 'deleg':3453 'delet':2684,2741,2799 'demo':1604,2011,2026,2053,2101 'deploy':2198 'describ':2842,3639 'design':1718,2653,3490 'destruct':2728,3429,3443 'detail':3034 'detect':2410 'develop':3297 'didn':3058 'die':1465,2110,2436 'differ':1021,1043 'disadvantag':598 'discoveri':605 'doc':2760 'domain':104,132 'domain-specif':103,131 'done':810 'doubl':1313,1825 'dozen':3085 'draft':2773 'dramat':1619 'dri':2836 'drop':85,1618,2928 'dry-run':2835 'duplic':2652 'durabl':519,1357,1379,3335 'dynam':770 'e':1235,1749,1760,2169,2175,3165,3171 'earli':2631,2929 'earlier':2889 'ecosystem':283 'edg':979,981,986,1439,1586,2045,2079 'effort':946 'els':1535,2993 'email':2688,2769,2772,2774,2801 'end':257,994,1007,1013 'endpoint':2567,2572 'entir':2486 'entri':2297 'environ':3651 'environment-specif':3650 'error':74,127,697,1587,2173,2208,2502,2902,3169,3217,3244,3293,3407,3417,3422,3427 'escal':1096 'escap':2178 'estim':1194,1196,1204,1904,1906,1910,1915 'evalu':324,329,828,841,865,885,911,1017,1024,1036,1048,1053,1054,1056,3476,3551 'evaluator-optim':328,884 'evaluator.critique':914 'even':2723 'eventu':2525 'everi':49,124,163,1673,2364,2640,2858,2909,3196,3198 'everyth':112,162,751,1959,2085,2742,2857,2992,3014 'evid':2360,2380 'exact':1389 'exceed':1812,1925 'except':1232,1233,1746,1747,2166,2167,3162,3163 'execut':27,63,197,289,306,311,368,472,540,546,562,575,576,588,613,620,660,675,685,717,720,753,766,778,812,1138,1212,1358,1380,1724,2144,2428,2810,2833,2853 'executor':652,656,690,691 'executor.execute':817 'exhaust':2918 'exist':2337 'expand':147 'expens':2240,2296,2306,2323,2324 'expense.amount':2349,2351 'expense.restaurant':2332,2338 'expert':3656 'expir':2460,2615,2624 'expiri':2511 'explain':3076 'explicit':381,2791 'explod':1777 'explor':258,350 'exponenti':1590,1848,2545,2556 'extern':261,2329,2443,2641,2649,2664,3009 'extra':50 'f':1923 'fabric':2214,2279,2287,2295 'factual':2365 'fail':168,704,1606,1614,2015,2041,2172,2450,2524,3053,3065,3168 'failur':53,1363,1392,1634,1677,1720,2031,2468,2520,2656,3098 'fake':2236 'fals':1544 'feedback':942 'fewer':1689 'file':1267,1270,1272,2764,2766 'final':448,450 'find':2378 'first':94,721,749,1507,2135 'fix':1680,1881,2112,2316,2532,2750,2937,3107 'flag':2353,2417 'float':2409 'follow':413 'forget':2888 'format':414,647,2465,2513 'found':2343 'foundat':299 'founder':2037 'framework':234 'full':556,583,759,1554,1841,3033,3194 'fundament':1668 'gap':2097 'general':136 'general-purpos':135 'generat':861,901,966,967,982,991,1030,3130 'generator.generate':903 'generator.refine':934 'get':1150,1804,2610,2927 'given':631 'go':775,1562 'goal':24,64,184,378,571,728,779,784,785,788,1140,1157,1459,1461,1514,1515,2303,2714,2746,2897,3582 'goal-decomposit':183 'good':273,1773 'gpt':465,1034,1040 'gpt-4o':464,1033 'gpt-4o-mini':1039 'grace':2185 'grant':1278 'graph':959,960,1430,1433 'graph.add':964,969,974,980,984 'graph.compile':997,1441,1493 'great':1643 'ground':2319 'grow':1309,1787,1857 'growth':1780 'guardedag':1105 'guardrail':145,203,262,1060,1074,1103,2716 'guess':2389 'guesswork':3079 'gui':3528 'guidanc':36 'halt':2008 'handl':653,1393,2595,2651,3413,3423 'handler':3428 'happi':2056 'hard':1883 'hatch':2179 'haven':2489 'help':2262 'hierarch':3030 'high':2219,2438,2676 'histori':1555,1556,1560,1570,1986 'hours/days':1394 'human':35,150,249,593,705,735,1095,1183,1398,1483,1517,1525,1765,2180,2290,2423,2821,2847 'human-in-loop':248 'human-in-the-loop':149,1482 'hung':3286 'id':530,712,1451,2150,3339,3347,3353 'idea':3062 'idempot':2638,2645 'identifi':871 'immedi':2529 'implement':389,3492 'impli':3558,3564,3569,3574,3581,3587,3593,3599,3607,3613,3618,3624 'import':392,399,492,498,616,954,1051,1412,1416,2136,2395,2539,2930,3110,3179 'imposs':3105 'impress':2025 'improv':844,895 'includ':2130 'inconsist':2472 'increas':1623,1697 'increment':2199 'independ':11,22,1633,2660 'individu':654 'infinit':3236 'info':3389 'inform':2228 'init':1107,1895,2601,2944 'initi':862,900 'input':417,429,430,478,1083,2060,2064,2132,3200,3665 'inputs/outputs':3189 'insight':72 'instruct':2890 'int':2588 'integr':2432,2483,2497,2670 'interact':344,3531 'interleav':762,777 'intern':3087 'interpret':2738 'interrupt':715,1487,1496,1502,1511 'intervent':1399 'invent':2231,2313 'invoc':1445,1508,3279,3350 'invok':521 'isn':39 'issu':872,3069 'iter':482,843,898,910,949,1004 'json':1705,2573 'keep':1324,1952,2977,2984,3013 'key':71,380,2639,3036 'kill':76 'know':446,2193 'langchain':397 'langchain.agents':391 'langchain.evaluation':1050 'langgraph':235,487,610,950,1403,1408,1552,3614 'langgraph.checkpoint.postgres':497,1411 'langgraph.graph':953,1415 'langgraph.prebuilt':491,615 'langsmith':3174,3178 'last':1772,1955,2985 'latenc':3203 'later':1467 'layer':1081,1102 'learn':772 'least':1252,2751 'leav':2469 'len':1341 'length':1308,1824 'less':599 'lesson':98 'lifecycl':2597 'limit':475,1093,1669,1814,1885,1930,2455,2500,2905,3215,3224,3230,3242,3249,3261,3627 'list':626,649,1557 'll':2718 'llm':461,514,689,692,3255 'llms':2257 'load':1052,1055 'log':161,2855,2860,3109 'logger':2138,2140,3112,3114 'logger.bind':2148,3122,3144 'logger.error':2170,3166 'logger.info':2152,2159,3128,3149,3155 'long':1370,1453,1833,2882,2916,3272,3311,3502 'long-run':1369,1832,2881,3271,3310 'long-task':1452 'long-term':3501 'longer':1809 'look':2234 'loop':59,153,182,251,331,335,355,486,859,992,1486,3212,3227,3237,3595 'lose':1625,2893,2934 'lost':2932 'low':2419 'low-confid':2418 'made':2243 'made-up':2242 'make':41,46,384,2692,3084 'manag':241 'mani':1694 'match':2814,3636 'matter':559,849 'max':481,780,793,897,909,948,1116,1123,1321,1729,1736,1752,1888,1949,1966,2561,2946,2951,3232 'maximum':3222 'may':607 'mayb':2963 'mean':2062 'measur':2124 'medium':2879,3035,3050 'meet':2301 'memori':221,225,3010,3027,3496,3499,3504,3543 'memory-system':220 'memorysav':1421,3288,3294,3300 'mention':3556,3562,3567,3572,3579,3585,3591,3597,3605,3611,3616,3622 'messag':535,1320,1326,1329,1331,1334,1342,1345,1356,1879,1948,1957,1965,1969,1971,1974,1977,2910,2957,2959,2987,3002,3225,3252,3277,3299,3325,3348,3373,3395,3418,3442 'methodolog':2087 'mid':603,1401,2462,2504 'mid-process':1400 'mid-task':602,2461,2503 'middl':1337,1344,1350,1976,1981,1984,2994,2999,3003 'might':2737 'min':2559 'mini':1042 'minim':1256 'minut':2628 'mismatch':2466 'miss':3673 'mock':2448 'mode':2838,3099 'model':463,513,1022,1032,1038,1714,1813,2901 'modern':302 'modif':1581,3451 'modifi':1536,2696,2804,3437 'money':1847,2803 'monitor':2207 'month':2029 'move':2019 'multi':206,210,317,552,1101,1376,1596,3327,3458,3462,3546 'multi-ag':3457 'multi-agent-orchestr':209,3461,3545 'multi-agent-system':205 'multi-day':1375 'multi-lay':1100 'multi-step':316,551,1595,3326 'multipl':1079,3321,3465 'multipli':52,1676,2557 'must':165,1383,2404,3220 'mysteri':1615,3054 'n':1956,2986 'name':427,2238 'nativ':1407 'need':260,442,758,1078,1174,1339,3023,3275,3323,3344,3456,3470,3481,3494,3507,3521 'negoti':160 'never':2425 'next':363,796,802,805,808,818,823,1151,1156,3191 'node':963,965,968,970,973,975,978,1437,1491 'non':159 'non-negoti':158 'none':740,1477,1533,1546,1583,2604,2607 'note':242,259,272,285,298,312,327 'number':648,663 'oauth':2508 'object':632,727 'observ':372,434,2134,2911 'observableag':2142 'oct':245 'old':3038 'one':764,1696,2453 'open':256 'open-end':255 'openai':398 'oper':1073 'optim':330,886,2711,2747 'orchestr':212,229,3464,3548 'os.environ':506,1426 'output':115,139,851,863,902,912,917,918,928,933,938,939,944,976,977,995,1089,1704,2266,2371,2392,2407,2414,2421,3202,3361,3367,3375,3387,3391,3399,3645 'overkil':3516 'overwhelm':2047 'p95/p99':2125 'pars':3404 'partial':2467,2519,2655 'pass':925 'past':1568 'past_state.values':1575 'path':2057,2722,2726 'pattern':67,191,194,290,332,352,541,563,838,855,1359,2347,2816,3589 'paus':718,1488,1500,1505,1509,1530 'per':83,1258,1640,1699,1878,1890,1919,1928 'per-step':1698 'permiss':1257,1262,2681,2744,2754,2784,3666 'persist':526,3303,3357,3495 'perspect':1044 'phase':544,566 'plan':25,62,196,305,309,539,543,557,561,569,581,586,591,606,612,618,639,666,683,701,708,724,725,731,737,750,760,763,795,2850,3626 'plan-execut':61,195,304,538,560,611 'planner':622,627,687,688 'planner.plan':804 'planner.reassess':833 'plausibl':2233,2265 'plausible-look':2232 'point':1390,1531,1579,3037 'possibl':386,1687 'postgr':507,1427 'postgressav':499,1413,3305 'postgressaver.from':503,1423 'potenti':578 'prefer':1688 'prevent':484,3235,3263,3285 'previous':667,669,937,1566 'principl':120,1254,2753 'privileg':1253,2752 'probabl':54,128,1588,1635,1678 'problem':346,2093,2311,3072 'problem-solv':345 'process':1378,1402,2523 'produc':2264 'product':69,237,264,489,500,1070,1076,1373,1381,1418,1608,2014,2023,2033,2061,2105,2117,2492,2685,2805,3100,3291,3309 'project':2109 'promis':2479 'prompt':404,407,469,471,628,657,1954,2362,2926,2979,3577 'propos':117,141 'prototyp':2021 'provid':1405 'purchas':2693 'purpos':137 'pydant':2394,3400 'quadrat':1310,1821 'quadrupl':1828 'qualiti':848 'queri':479,536 'question':410,415,418,2089 'rais':1168,1208,1240,1755,1921,2176,2339,2591,3172 'rang':792,908,1735 'rare':1277,2780 'rate':75,82,1617,1639,2127,2209,2454,2499 'ratelimiterror':2592 'rather':2387 're':580,700,827,1838,3095 're-evalu':826 're-plan':579,699 're-send':1837 'react':60,193,291,333,351,354,388,394,403,406,459,470,488,494,511,3588 'react-pattern':192 'read':1266,1269,2757,2759,2763,2771,2787 'read/write':2783 'real':2001,2043,2095,2452,2496 'real-tim':2000 'realiz':1646 'reason':293,337,358,382,420 'receiv':373,2522 'recent':1328,1333,1354,1973,1990,1993,2988,3007,3032 'recipi':2691 'recommend':1679,1880,2111,2315,2531,2749,2936,3106 'recov':3415 'recover':2661 'recoveri':3408 'reduc':1681 'refer':2405 'refin':874,929 'reflect':66,190,320,837,854,891,893,951,958,971,972,983,987,3608 'reflection-pattern':189 'reflectionst':962 'refresh':2509,2630 'reject':1191 'relat':3532 'releas':244 'reliabl':48,70,93,121,200,1701,2073,2104,3394,3403 'remain':787,829,832 'repeat':376,439,767,879,2899 'replan':695 'replay':1576 'report':2241,2247,2307,3071 'reproduc':3068 'request':2820 'requir':1134,2359,2790,2795,3664 'research':1263,2755 'research/experimentation':254 'respons':1935,1941,2197,2569,2594 'response.headers.get':2581 'response.status':2576 'restart':1386 'restaur':2237,2336,2341 'result':374,435,476,533,668,670,738,816,825,1221,1512,1739,1743,1745,2156,2162,2163,2165,3153,3158,3159,3161,3383 'result.is':1242 'resum':1365,1387,1466,1770 'retri':1726,1730,1737,1753,1757,2540,2548,2579,2583,2589,2648 'retriev':3021 'retry-aft':2582 'return':835,927,943,996,1006,1012,1014,1189,1245,1301,1351,1355,1744,1968,1987,1991,2164,2593,2620,2632,2827,2830,3137,3160,3190 'revers':2870,3441 'review':730,1518,1766,2291,2355,2424,3657 'revok':2806 'robust':1019,2534 'robustag':1722 'robustapicli':2547 'role':268 'role-bas':267 'rollback':171,1098,1214,2666,3432,3446 'round':2350,2357 'run':1273,1371,1784,1808,1834,2118,2767,2837,2873,2883,3273,3312,3320 'runaway':485,3264 'runtim':1396 'safe':169 'safeti':1065,1080,3667 'satisfactori':881 'save':1446,3193,3448 'say':2272,2381 'scale':1820,2068,2115,2495 'schema':1706 'scope':130,204,1282,3638 'score':1010,2416 'screen':3530 'sdk':280 'search':1265 'second':96 'see':3024,3048 'segment':1764,1769 'self':31,187,323,840,857,888,1028,1108,1139,1727,1896,1903,1934,2145,2566,2602,2612,2625,2945,2956,2965,2975,3119,3141,3576,3601 'self-bia':1027 'self-correct':30,186,856,3600 'self-critiqu':887 'self-evalu':322,839 'self-prompt':3575 'self._execute':2157 'self.actual':1227 'self.agent':1111 'self.agent.execute':1223 'self.agent.plan':1155 'self.allowed':1126,1166 'self.calculate':1938 'self.client.post':2571 'self.compact':2972 'self.estimate':1198,1908 'self.expires':2605,2634 'self.is':2614 'self.llm.generate':3126 'self.log':1756 'self.max':1113,1120,1148,1206,1917,1926,2949,2969 'self.maybe':2960 'self.messages':2953,2981,2989,2995,3004 'self.messages.append':2958 'self.refresh':2618 'self.request':1182 'self.require':1131,1178 'self.rollback':1237 'self.save':1218 'self.token':2603,2616,2621,2967 'self.total':1897,1913,1936 'self.validate':1742 'send':1839,2687,2776,2800 'separ':308,542,595,1016,2782 'server':1385,1464 'set':1882,2697 'sever':1591,1781,2016,2218,2437,2675,2878,3049,3216,3243,3269,3292,3316,3340,3364,3388,3409,3433 'sharp':1585 'shortest':2721 'shouldn':2699 'show':2054 'silent':173 'similar':3176 'simpl':1609 'simpler':2188 'singl':1795 'situat':1593,1783,2018,2220,2439,2677,2880,3051 'skill':56,3029,3533,3630 'skill-autonomous-agents' 'slower':1805 'small':1695 'solv':347,2309 'sound':1642 'sourc':2330,2402 'source-sickn33' 'special':275,1047 'specif':105,133,1490,2369,3652 'spend':3265 'sqlitesav':3307 'stabl':756 'stack':2488 'stage':2212 'stakehold':2028 'stale':609 'start':1456,2154,2898,3151 'state':240,520,525,783,801,806,815,820,831,834,836,1001,1003,1009,1447,1468,1470,1519,1520,1522,1527,1537,1541,1559,1567,1569,1573,2473,3185,3192,3354,3438,3449 'state.is':1474 'stategraph':955,961,1417,1434 'statist':2245 'status':1190,1246 'step':84,89,125,297,318,340,474,553,636,638,641,651,655,661,662,672,674,677,703,765,781,790,794,1121,1124,1144,1147,1149,1230,1597,1631,1641,1648,1654,1660,1675,1682,1685,1692,1700,1711,1717,1728,1758,2658,3093,3123,3145,3184,3197,3206,3214,3223,3229,3233,3322,3328 'step-by-step':635 'step.execute':1740 'stop':2541,2549,2550,3658 'store':1553,3017 'str':2174,2401,2403,3170 'strategi':746 'stream':1995 'string':505,1425 'structlog':2137,3111 'structlog.get':2139,3113 'structur':1703,2391,3108,3386,3390,3398 'stuck':2217,2268 'substitut':3648 'subtask':573,577 'succeed':1611 'success':81,1616,1638,1650,1656,1662,2126,3670 'summar':1336,1349,1958,1983,2991,3001,3031 'summari':1348,1353,1982,1989,3000,3006,3040 'support':1796,2373,2379 'surpris':1800 'surviv':1362,1384 'suspici':2346,2356 'symptom':1600,1790,2024,2229,2445,2682,2886,3055 'system':8,19,208,222,226,1071,1325,1330,1352,1374,1953,1970,1988,1992,2049,2106,2361,2444,2470,2493,2650,2925,2978,2980,3005,3028,3500,3544 't.name':1298 'take':2672,2719 'takeov':2181 'task':319,554,604,625,713,757,771,853,896,904,915,916,935,936,1259,1261,1284,1288,1372,1454,1457,1610,1613,1621,1891,1920,1929,2146,2149,2153,2158,2160,2171,2225,2463,2505,2885,2917,3287,3519,3634 'task.id':2151 'task_permissions.get':1287 'team':271 'tech':2487 'templat':405 'tenac':2538 'term':3503 'test':1274,2113,2120,2768,2840,3479,3654 'test/evaluate':3472 'think':3118,3124 'thought':357,419,443,2913,3125,3129,3131,3132,3136,3138 'thought/action/observation':440 'thousand':1798 'thread':523,529,711,1450,3338,3346,3352 'threshold':2184 'ticket':1797 'time':1549,2002 'time-travel':1548 'timedelta':2627 'timeout':3268,3276,3281,3283 'togeth':3468 'token':1322,1810,1861,1866,1871,1876,1905,1911,1950,1964,1967,2459,2510,2611,2619,2904,2947,2950,2952,2970,3133,3135,3207 'tokenmanag':2599 'told':2731 'tool':28,214,218,233,288,348,370,426,433,467,468,515,516,680,693,694,1290,1296,1303,1304,2370,2406,3482,3487,3489,3539 'tool-build':213 'topic-agent-skills' 'topic-agentic-skills' 'topic-ai-agent-skills' 'topic-ai-agents' 'topic-ai-coding' 'topic-ai-workflows' 'topic-antigravity' 'topic-antigravity-skills' 'topic-claude-code' 'topic-claude-code-skills' 'topic-codex-cli' 'topic-codex-skills' 'total':1141,1202,1210,1225,1248,1250 'trace':383,3103,3180,3181,3187,3195 'tracedag':3116 'track':1997,2894,2938,3240,3247,3258 'traffic':2202 'train':2259,2412 'transfer':2802 'transform':1818 'travel':1550 'treat':113,138,3643 'tri':1220,1738,2155,2299,3152 'trigger':3454 'trim':1318,1855,1942,1946 'true':698 'truncat':2924 'trust':1626 'truth':119,143,2320 'try/catch':3425 'turn':1844,1858,1863,1868,1873 'two':565 'two-phas':564 'type':1260,1285,1289 'typo':2065 'uncertain':2430 'uncertainti':2411 'undon':2706 'unexpect':2063 'uniqu':3345 'updat':814 'uptim':2075 'url':508,1428 'usag':2940,3208 'usd':1118,1893 'use':287,343,349,411,549,678,847,1020,1046,1069,1368,1702,1712,1994,2039,2390,2637,2781,3008,3173,3254,3289,3304,3359,3372,3376,3397,3526,3554,3628 'user':531,1624,2044,2070,3070,3455,3469,3480,3493,3506,3520,3555,3561,3566,3571,3578,3584,3590,3596,3604,3610,3615,3621 'usual':2077 'v1':2516 'v2':2518 'valid':1084,1090,1158,1504,1708,2196,2317,2322,3209,3363,3370,3378,3379,3653 'validate/modify':590 'validationerror':2340 'variat':2514 'vector':3019 'verifi':2335,2385 'verifiedclaim':2398 'visibl':558,584,3090 'vs':2517 'wait':2544,2554,2555 'warn':3270,3317,3341,3365,3410,3434 'web':1264,2758 'webhook':2521 'well':3535 'window':2877,2920 'winner':100 'without':33,1854,2289,2694,2715,3089,3102,3213,3228,3256,3267,3280,3314,3330,3337,3351,3362,3377,3385,3406,3421,3431,3445 'won':3355 'work':830,1602,2012,2034,2078,2100,2446,3060,3467,3534 'workflow':228,231,3508,3511 'workflow-autom':230,3510 'workflow-orchestr':227 'would':1924,2845 'write':1271,2765,2789 'wrong':2253,2690","prices":[{"id":"6c372823-9577-4ee1-921d-8218da9ce89b","listingId":"bae8df4a-edb9-4a9d-b977-da3e04051486","amountUsd":"0","unit":"free","nativeCurrency":null,"nativeAmount":null,"chain":null,"payTo":null,"paymentMethod":"skill-free","isPrimary":true,"details":{"org":"sickn33","category":"antigravity-awesome-skills","install_from":"skills.sh"},"createdAt":"2026-04-18T20:37:20.303Z"}],"sources":[{"listingId":"bae8df4a-edb9-4a9d-b977-da3e04051486","source":"github","sourceId":"sickn33/antigravity-awesome-skills/autonomous-agents","sourceUrl":"https://github.com/sickn33/antigravity-awesome-skills/tree/main/skills/autonomous-agents","isPrimary":false,"firstSeenAt":"2026-04-18T21:31:38.048Z","lastSeenAt":"2026-05-18T18:50:31.723Z"},{"listingId":"bae8df4a-edb9-4a9d-b977-da3e04051486","source":"skills_sh","sourceId":"sickn33/antigravity-awesome-skills/autonomous-agents","sourceUrl":"https://skills.sh/sickn33/antigravity-awesome-skills/autonomous-agents","isPrimary":true,"firstSeenAt":"2026-04-18T20:37:20.303Z","lastSeenAt":"2026-05-07T22:40:45.532Z"}],"details":{"listingId":"bae8df4a-edb9-4a9d-b977-da3e04051486","quickStartSnippet":null,"exampleRequest":null,"exampleResponse":null,"schema":null,"openapiUrl":null,"agentsTxtUrl":null,"citations":[],"useCases":[],"bestFor":[],"notFor":[],"kindDetails":{"org":"sickn33","slug":"autonomous-agents","github":{"repo":"sickn33/antigravity-awesome-skills","stars":37911,"topics":["agent-skills","agentic-skills","ai-agent-skills","ai-agents","ai-coding","ai-workflows","antigravity","antigravity-skills","claude-code","claude-code-skills","codex-cli","codex-skills","cursor","cursor-skills","developer-tools","gemini-cli","gemini-skills","kiro","mcp","skill-library"],"license":"mit","html_url":"https://github.com/sickn33/antigravity-awesome-skills","pushed_at":"2026-05-18T08:24:49Z","description":"Installable GitHub library of 1,400+ agentic skills for Claude Code, Cursor, Codex CLI, Gemini CLI, Antigravity, and more. Includes installer CLI, bundles, workflows, and official/community skill collections.","skill_md_sha":"b9b9b3f78fac26edab21e2b577dd4fb89c4e57fc","skill_md_path":"skills/autonomous-agents/SKILL.md","default_branch":"main","skill_tree_url":"https://github.com/sickn33/antigravity-awesome-skills/tree/main/skills/autonomous-agents"},"layout":"multi","source":"github","category":"antigravity-awesome-skills","frontmatter":{"name":"autonomous-agents","description":"Autonomous agents are AI systems that can independently decompose"},"skills_sh_url":"https://skills.sh/sickn33/antigravity-awesome-skills/autonomous-agents"},"updatedAt":"2026-05-18T18:50:31.723Z"}}