newsletter.semianalysis.com › p Another Giant Leap: The Rubin CPX Specialized Accelerator & Rack … Serving an LLM request involves two phases: the prefill phase and decode phase. In the prefill phase, the LLM generates the first token from the user prompt. … Sep 10, 2025 · Dylan Patel