Agentic SRE Advanced By Samson Tanimawo, PhD Published Jun 30, 2026 5 min read

The Agent Cost Bomb: Pre-emptive Token Budgets

One stuck agent can burn $400 in an hour. The budget enforcement layer that stops it before it does, plus the alerting that wakes you up if budgets blow up across runs.

Three budget dimensions

Token budgets need three dimensions because the failure modes operate at different scopes. A single budget cap protects against one failure shape and misses the others.

Enforcement layer

Enforcement happens before the model call, not after. After-the-fact accounting is billing, not safety.

Alerting on budget excursions

Budget alerts route differently by scope. Per-run hits are individually noisy; aggregate hits are individually critical.

Calibrating the budgets

Budget calibration is empirical. Three steps move it from arbitrary to defensible.

Fail closed on budget exhaustion

When the budget runs out, the agent stops. Failing open invites runaway runs and the cost-bomb shape this whole pattern exists to prevent.