The problem: AI agents make API calls that cost money — LLM inference, tool calls, third-party services. Most setups have no hard spending limits. An agent loop or prompt injection can burn through hundreds of dollars before anyone notices. Rate limiting doesn't help because it doesn't understand cost.
SatGate sits in front of your agent's outbound calls and enforces economic policy:
• Hard budget caps — per-agent, per-tool, per-time-window. Not alerts, actual enforcement. The call gets rejected.
• Per-tool cost attribution — MCP-aware. Knows which tool in a chain caused what spend. Not just "1,000 requests" but "Agent X spent $47 on search_database and $12 on send_email."
• Macaroon capability tokens — cryptographic credentials with built-in caveats (budget, time window, allowed tools). Agents can sub-delegate scoped tokens without calling home. Not API keys.
• L402 Lightning micropayments — agents can pay for API access per-call using HTTP 402. Sub-cent pricing that doesn't work on card rails.
It's not a routing gateway. LiteLLM and Bifrost solve which provider handles a request. SatGate decides whether the request should happen at all given your budget constraints. They're complementary — SatGate sits in front of a routing gateway.
What it doesn't do: It doesn't optimize costs, negotiate rates, or pick cheaper providers. It's a policy enforcement layer, not an optimizer.
Single binary, 60-second quickstart, <50ms overhead.
GitHub: https://github.com/SatGate-io/satgate Blog: https://satgate.io/blog/why-routing-isnt-governance