Day 0 Support for Gemma 4 on AMD Processors and GPUs
… Conclusion With Gemma 4, Google continues to improve the state-of-the-art in compact open-weights models. …
Tracked topic
Gemma is a family of open-weight language models released by Google for text generation and related NLP tasks.
The Gemma 4 family of open-weights models from Google includes four variants, spanning a range of sizes from 2B effective parameters to 31B parameters and including both Mixture of Experts (MoE) and dense architectures. These multimodal models ingest text, vision, and for select variants, audio inputs and generate text outputs. They support context sizes of up to 256K tokens, and have been trained for thinking, coding, function calling, optical character recognition (OCR), object recognition and automatic speech recognition tasks. For relatively compact models they have outstanding language s
Day 0 Support for Gemma 4 on AMD Processors and GPUs… Conclusion With Gemma 4, Google continues to improve the state-of-the-art in compact open-weights models. …
… Figure 2 PACE/vLLM Speedup demonstrates these gains across input lengths 128, 1920 , output lengths 128, 1920 , and batch sizes 1, 32, 128 over a diverse set of models spanning multiple architectures and scales : Phi-4, OPT-6.7B, Llama-3.1-8B, Llama-3.2-3B, Qwen2-7B, Gemma-3-4B, and Gemma-3-12B . …