Search

Showing top 135 results for "Gemma releases"

Boost Inference Performance up to 15x on NVIDIA Blackwell Using DFlash Speculative Decoding | NVIDIA Technical Blog

…paper’s release, the research team has released 20 DFlash model checkpoints on Hugging Face with Blackwell and Hopper recipes, covering model families including Qwen, Kimi K2.6, Llama, Gemma, and gpt…

Jun 23, 2026 · Amr Elmeleegy

I replaced the expensive Claude Pro subscription with these local models, and my productivity didn’t drop a bit

…Gemma 4 E4B The ‘everyday’ champion If Qwen3.6 is my heavy-duty engineer, Gemma 4 E4B is my nimble, everyday champion. When it comes to local LLMs, we often think that…

Apr 21, 2026 · Parth Shah

How to run a local AI chatbot on your iPhone - Engadget

…First, the current pace at which companies like OpenAI are releasing new models means those systems inherently incorporate more recent data since they're newer. Moreover, since you need an internet connection…

May 28, 2026 · Igor Bonifacic

Android Studio I/O Edition: What’s new in Android Developer tools

…This integration supports uploading an initial release of a brand-new app to Play Console’s internal test track. You can also use this feature to upload releases to existing apps to…

Discussions and forums

r/LocalLLaMA · u/rerri · May 5, 2026

Gemma 4 MTP released

Blog post: https://blog.google/innovation-and-ai/technology/developers-tools/multi-token-prediction-gemma-4/ MTP draft models: https://huggingface.co/google/gemma-4-31B-it-assistant https://huggingface.co/google/gemma-4-…

r/LocalLLaMA · u/jacek2023 · 3w ago

google/gemma-4-12B · Hugging Face

Gemma is a family of open models built by Google DeepMind. Gemma 4 models are multimodal, handling text and image input (with audio supported on E2B, E4B, and 12B) and generating text output. This release includes open-w…

Hacker News · u/ericlbuehler · 1w ago

Show HN: Run Agent Skills with mistral.rs v0.8.10: /v1/skills support and more

Hey all! I'm the maintainer of mistral.rs. I just landed support for OpenAI-compatible Agent Skills via a /v1/skills endpoint, and it works with local open models.Until now Skills have basically been locked to closed mod…

20 2

r/LocalLLaMA · u/The_Paradoxy · May 11, 2026

The Qwen 3.6 35B A3B hype is real!!!

My personal test for small local LLM intelligence is to check whether a model has any ability to understand the code that I write for my own academic research. My research is on some pretty niche topics and I doubt that …

r/LocalLLaMA · u/oobabooga4 · May 13, 2026

TextGen is now a native desktop app. Open-source alternative to LM Studio (formerly text-generation-webui).

Hi all, I have been making a lot of updates to my project, and I wanted to share them here. TextGen (previously text-generation-webui, also known as my username oobabooga or ooba) has been in development since December 2…

Sesame's AI voice app is the best I've tested. That's what worries me

…Enter Sesame, which has been working on its own voice AI system for more than a year—my colleague Mark Hachman tried an earlier incarnation last February—and has just released a…

Jun 3, 2026 · By Ben Patterson

Hermes Unlocks Self-Improving AI Agents, Powered by NVIDIA RTX PCs and DGX Spark

…Get the real-time responsiveness needed for local AI, where agents can tackle multistep tasks and refine their skills to keep workflows seamless. Google’s Gemma 4 26B and 31B models now…

May 13, 2026 · Abhishek Gore

Embedded AI Archives

…Built on Google’s Gemini research, Gemma 3 is a versatile workhorse for Jetson. It is multimodal out of the box, which means it can see and talk in over 140 languages…

May 7, 2026

Generative AI Archives

…Built on Google’s Gemini research, Gemma 3 is a versatile workhorse for Jetson. It is multimodal out of the box, which means it can see and talk in over 140 languages…

May 7, 2026

GTC 2026 Archives

…Built on Google’s Gemini research, Gemma 3 is a versatile workhorse for Jetson. It is multimodal out of the box, which means it can see and talk in over 140 languages…

May 7, 2026

NVIDIA Cosmos Archives

…Built on Google’s Gemini research, Gemma 3 is a versatile workhorse for Jetson. It is multimodal out of the box, which means it can see and talk in over 140 languages…

May 7, 2026

Followed topics

Search

Boost Inference Performance up to 15x on NVIDIA Blackwell Using DFlash Speculative Decoding | NVIDIA Technical Blog

Top stories

I split my coding work between Claude, Qwen3-Coder and Gemma 4, and it costs less than paying for one subscription

I tried Google's new DiffusionGemma, and watching it generate text like an image is unlike any local LLM

I replaced the expensive Claude Pro subscription with these local models, and my productivity didn’t drop a bit

How to run a local AI chatbot on your iPhone - Engadget

Android Studio I/O Edition: What’s new in Android Developer tools

Discussions and forums

Gemma 4 MTP released

google/gemma-4-12B · Hugging Face

Show HN: Run Agent Skills with mistral.rs v0.8.10: /v1/skills support and more

The Qwen 3.6 35B A3B hype is real!!!

TextGen is now a native desktop app. Open-source alternative to LM Studio (formerly text-generation-webui).

Sesame's AI voice app is the best I've tested. That's what worries me

Hermes Unlocks Self-Improving AI Agents, Powered by NVIDIA RTX PCs and DGX Spark

Embedded AI Archives

Generative AI Archives

GTC 2026 Archives

NVIDIA Cosmos Archives