Track coding-agent quota burn and remaining headroom across providers with onWatch
Monitor quota, spend, resets, and alerts across multiple coding-agent providers from one local dashboard before a long run hits throttling or budget limits.
What it does
Track coding-agent quota burn and remaining headroom across providers with onWatch
Monitor quota, spend, resets, and alerts across multiple coding-agent providers from one local dashboard before a long run hits throttling or budget limits.
Prerequisites
onWatch binary or Homebrew install, local shell access, provider credentials for one or more supported services, and optional browser access for the local dashboard
Installation
Use the upstream install or setup path that matches your environment:
- docker run -d --name onwatch -p 9211:9211 \
- git clone https://github.com/onllm-dev/onwatch.git && cd onwatch
- docker-compose up -d
- docker-compose logs -f
Requirements and caveats from upstream:
- | GEMINI_REFRESH_TOKEN | Gemini OAuth refresh token (for Docker/headless) |
- | GEMINI_ACCESS_TOKEN | Gemini OAuth access token (for Docker/headless) |
- | ANTIGRAVITY_BASE_URL | Antigravity base URL (for Docker/manual config) |
Basic usage or getting-started notes:
-
Track usage across Synthetic, Z.ai, Anthropic, Codex, GitHub Copilot, [MiniMax](http...
-
See history, get alerts, and open a local web dashboard before you hit throttling or run over budget. Additionally, you can ingest local telemetry from your own API-driven workflows with API Integrations, keeping trac...
-
onWatch fills the gap between "current usage snapshot" and the historical, per-cycle, cross-session view that developers actually need. It runs as a lightweight background agent (<50 MB RAM with all nine providers pol...
-
Extracted from upstream docs: https://raw.githubusercontent.com/onllm-dev/onWatch/HEAD/README.md
Documentation
Source
Capabilities
Install
Quality
deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 8 github stars · SKILL.md body (2,009 chars)