Search

Showing top 135 results for "Gemma 4 local use"

Related topics: Google Gemma

Tracked topic

Google Gemma

Gemma is a family of open-weight language models released by Google for text generation and related NLP tasks.

112 articles indexed Last updated 1d ago See topic hub

Top stories

Discussions and forums

r/docker · u/CreativeCollege2815 · 1w ago

Using a Gemma4 Safetensor Already Downloaded Locally

Hi everyone. I need some help or advice. I’m learning how to use N8N, so I downloaded Docker and installed N8N locally. I also wanted to install Gemma4, which I use in ComfyUI to help with image generation prompts. Is it…

r/LocalLLaMA · u/gladkos · May 1, 2026

Qwen 3.6 27B vs Gemma 4 31B - making Packman game!

Gemma just crushed Qwen in a local LLM gamedev contest! Device: MacBook Pro M5 Max, 64GB RAM Qwen 3.6 27B: 32 tokens/sec · 18m 04s · 33,946 tokens. Gemma 4 31B: 27 tokens/sec · 3m 51s · 6,209 tokens. So what is more impo…

Hacker News · u/lostathome · 2d ago

Show HN: Hitoku Draft – Context aware local assistant

Hi guys.I have been working on Hitoku Draft, an open-source, voice-first AI assistant that runs entirely locally. I posted about it already, and now it has also transcription with voice editing. Looking for feedback, as …

15 1
Hacker News · u/theodorehq · 2w ago

Show HN: Charm – on-device spelling, grammar, and prediction for macOS

I've spent the last year building Charm, a native macOS menu bar app that corrects spelling, fixes grammar, and predicts your next word.Three features:- Spells: NSSpellChecker plus a local LLM for context-aware correctio…

3 1
r/LocalLLaMA · u/gladkos · May 8, 2026

Multi-Token Prediction (MTP) for LLaMA.cpp - Gemma 4 speedup by 40%

Implemented Multi-Token Prediction for LLaMA.cpp. Quantized Gemma 4 assistant models into GGUF format. Ran tests on a MacBook Pro M5Max. Gemma 26B with MTP drafts tokens 40% faster. Prompt: Write a Python program to find…