Skip to main content
institutional access

You are connecting from
Lake Geneva Public Library,
please login or register to take advantage of your institution's Ground News Plan.

Published loading...Updated

Google's Latest DiffusionGemma Open AI Model Comes with a 4x Speed Boost

Google says the experimental model matches other Gemma systems and runs about 4 times faster, with weights available on Hugging Face.

  • Google released DiffusionGemma, an experimental model that delivers faster text generation than previous Gemma versions through simultaneous token prediction.
  • Nvidia and Google collaborated to ensure optimization across diverse hardware setups, including enterprise systems like the H100 and DGX Spark, plus quantized RTX GPUs with efficient HBM.
  • Model weights are available for download from Hugging Face under the same Apache 2.0 license as other fourth-generation Gemma models, enabling broad developer access.
  • Google recently implemented Multi-Token Prediction drafters to utilize idle compute cycles; diffusion, however, is even faster than the MTP versions of Gemma.
  • Diffusion models offer efficient compute usage, but face drawbacks in text generation; because language is discrete, errors can render tokens meaningless and force users to restart.
Insights by Ground AI

22 Articles

Google has released an experimental open-source model, DiffusionGemma, which radically changes the traditional approach to text generation. Unlike standard models like Gemma 4, which write strictly sequentially—word by word—the new model generates an entire text array at once as a random set of "noisy" tokens, and then, in several passes, cleans and edits it until it is readable. Essentially, while conventional AI models write text sequentially,…

Google has presented without too much noise its new AI DiffusionGemma, an open model that changes the way to generate text to prioritize speed in local GPU, even accepting a loss of quality compared to Gemma 4. The movement of the great G fits better within the pulse of open models that are pushing from China, with Qwen or DeepSeek as unavoidable references, rather than within a direct comparison with GPT or Claude, where the battle is fought wi…

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/yearSubscribe

Bias Distribution

  • 100% of the sources are Center
100% Center

Factuality Info Icon

To view factuality data please Upgrade to Premium

Ownership

Info Icon

To view ownership data please Upgrade to Vantage

deepmind.google broke the news on Wednesday, June 10, 2026.
Too Big Arrow Icon
Sources are mostly out of (0)

Similar News Topics

News
Feed Dots Icon
For You
Search Icon
Search
Blindspot LogoBlindspotLocal