Google's latest trick gets Gemma 4 running 3x faster right on your phone
…How does MTP improve Gemma 4? The process uses a technique called “ Speculative Decoding ,” in which the drafter models predict upcoming words in the prompt even before the main Gemma model has…
