Search

Showing top 139 results for "Gemma 4 releases"

Google Gemma

Gemma is a family of open-weight language models released by Google for text generation and related NLP tasks.

144 articles indexed Last updated 3d ago See topic hub

AMD PACE - A vLLM Plugin for CPU Inference

…Support for MOE is under active development and would be part of subsequent release. The four components in this first release are: Plugin infrastructure - registering PACE as a vLLM plugin. SLAB backend…

Jun 11, 2026 · Arjun Muraleedharan

007 First Light Is Fixing the Stealth Genre's Biggest Problem With One Feature

…The supporting cast includes Lenny Kravitz, Gemma Chan, Lennie James, Priyanga Burford, and Alastair Mackenzie. Instinct Being a Manageable Resource Adds to the Challenge Bond won't be able to rely on…

May 18, 2026 · LB Beistad

Google's TurboQuant cuts AI working memory by 6x, but it won't fix the global RAM shortage

…including LongBench, Needle in a Haystack, ZeroSCROLLS, RULER, and L-Eval, using the open-source Gemma and Mistral LLMs. The results show that TurboQuant could make AI cheaper to run, reducing its…

Mar 27, 2026 · Hassam Nasir

I ditched Claude for Obsidian and a local LLM, and miss it less than I expected to

…Now, I've been running better models such as Qwen 3.5 9B and Gemma 4 E4B , and have also tested various local LLM runners beyond LM Studio. But I still did…

Jun 1, 2026 · Nolen Jonker

Discussions and forums

r/LocalLLaMA · u/rerri · May 5, 2026

Gemma 4 MTP released

Blog post: https://blog.google/innovation-and-ai/technology/developers-tools/multi-token-prediction-gemma-4/ MTP draft models: https://huggingface.co/google/gemma-4-31B-it-assistant https://huggingface.co/google/gemma-4-…

r/LocalLLaMA · u/jacek2023 · 4w ago

google/gemma-4-12B · Hugging Face

Gemma is a family of open models built by Google DeepMind. Gemma 4 models are multimodal, handling text and image input (with audio supported on E2B, E4B, and 12B) and generating text output. This release includes open-w…

Hacker News · u/ericlbuehler · 2w ago

Show HN: Run Agent Skills with mistral.rs v0.8.10: /v1/skills support and more

Hey all! I'm the maintainer of mistral.rs. I just landed support for OpenAI-compatible Agent Skills via a /v1/skills endpoint, and it works with local open models.Until now Skills have basically been locked to closed mod…

20 2

r/LocalLLaMA · u/The_Paradoxy · May 11, 2026

The Qwen 3.6 35B A3B hype is real!!!

My personal test for small local LLM intelligence is to check whether a model has any ability to understand the code that I write for my own academic research. My research is on some pretty niche topics and I doubt that …

r/LocalLLaMA · u/oobabooga4 · May 13, 2026

TextGen is now a native desktop app. Open-source alternative to LM Studio (formerly text-generation-webui).

Hi all, I have been making a lot of updates to my project, and I wanted to share them here. TextGen (previously text-generation-webui, also known as my username oobabooga or ooba) has been in development since December 2…

Your old gaming PC is overkill for a home server, and that's exactly why it's perfect

…Heck, I’ve deployed a Gemma-4-26B-A4B on a GTX 1080 – a decade-old GPU that’s hooked up to my ancient first-gen Ryzen rig. Considering my token-generation…

May 30, 2026 · Ayush Pande

You don't need an expensive GPU to run a local LLM that actually works

…Because Meta releases the weights openly, the community has built countless quantized versions optimized for consumer hardware. The correct answer is Llama. While Falcon, Gemma (Google), and Mistral are all legitimate open…

Apr 29, 2026 · Rich Edmonds

ASUS AI POD with NVIDIA Vera Rubin NVL72 | Liquid-Cooled AI

ASUS Unveils Game-Changing Liquid-Cooled AI Infrastructure Powered by NVIDIA Vera Rubin Platform March 17, 2026 ・ Press Release Delivering trusted AI with total flexibility, from rack-scale AI factories to edge…

Mar 17, 2026

007 First Light: 5 Things You Need To Know

…The game is coming to various platforms on May 27 (the Switch 2 version will arrive later ), shaping up as one of the biggest action releases of the year. Based on hands…

May 12, 2026 · Vlad Mazanko

The MCU's Phase 5 Was Secretly Brilliant (Despite Being Disjointed)

…Gone are the days of every release being certified fresh on Rotten Tomatoes. Phase 4 marked the start of the Multiverse Saga , which had a lot to live up to after the…

May 25, 2026 · Kevin Pantoja

To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.

Followed topics