DynoSim: Simulating the Pareto Frontier | NVIDIA Technical Blog
…A Router needs the current cache state and decode load. The Planner needs traffic, worker state, and SLA signals. KVBM needs transfer pressure, tier capacity, and future cache availability. Multi-engine simulation…
