Google's latest DiffusionGemma open AI model comes with a 4x speed boost
…This approach to text generation shifts the bottleneck from memory bandwidth to compute, generating up to 256 tokens in parallel. Google says this offers a measurable boost in non-linear tasks like…
