Paper page - Agentic AI Systems Should Be Designed as Marginal Token Allocators
… But adopting marginal token allocation as the shared accounting object explains why systems that locally minimize tokens globally misallocate them, predicts a small set of recurring failure modes over-routing, over-delegation, under-verification, serving congestion, stale rollouts, cache misuse , a… …