AMD PACE - High-Performance Platform Aware Compute Engine
…Scheduling and padding strategies are tuned to sustain high throughput across varying sequence lengths and batch sizes. Speculative decoding: Multi-token decode is optimized for AMD Parallel Draft (PARD) speculative decoding ( see…