Search

Showing top 126 results for "Gemma 4 local use"

Related topics: Google Gemma

Tracked topic

Google Gemma

Gemma is a family of open-weight language models released by Google for text generation and related NLP tasks.

102 articles indexed Last updated 13h ago See topic hub
tomshardware.com › tech-industry › artificial-intelligence

Maker packs an opinionated, googly-eyed AI chatbot into a mobile suitcase, powered by an Nvidia Jetson — entirely local machine entity runs Gemma 4 E4B and can respond in 200ms

Gemma 4 E4B, ~200ms cached TTFT, 30+ sensors, no WiFi/BT/cellular. He has opinions. from r/LocalLLaMA In the r/LocalLLaMA subreddit, CreativelyBankrupt outlines the ‘recipe’ for this characterful digital companion. “Sparky runs entirely on the Jetson. …

May 17, 2026 · Mark Tyson

Top stories

Discussions and forums

r/docker · u/CreativeCollege2815 · 1w ago

Using a Gemma4 Safetensor Already Downloaded Locally

Hi everyone. I need some help or advice. I’m learning how to use N8N, so I downloaded Docker and installed N8N locally. I also wanted to install Gemma4, which I use in ComfyUI to help with image generation prompts. Is it…

r/LocalLLaMA · u/gladkos · May 1, 2026

Qwen 3.6 27B vs Gemma 4 31B - making Packman game!

Gemma just crushed Qwen in a local LLM gamedev contest! Device: MacBook Pro M5 Max, 64GB RAM Qwen 3.6 27B: 32 tokens/sec · 18m 04s · 33,946 tokens. Gemma 4 31B: 27 tokens/sec · 3m 51s · 6,209 tokens. So what is more impo…

Hacker News · u/theodorehq · 1w ago

Show HN: Charm – on-device spelling, grammar, and prediction for macOS

I've spent the last year building Charm, a native macOS menu bar app that corrects spelling, fixes grammar, and predicts your next word.Three features:- Spells: NSSpellChecker plus a local LLM for context-aware correctio…

3 1
r/LocalLLaMA · u/gladkos · 3w ago

Multi-Token Prediction (MTP) for LLaMA.cpp - Gemma 4 speedup by 40%

Implemented Multi-Token Prediction for LLaMA.cpp. Quantized Gemma 4 assistant models into GGUF format. Ran tests on a MacBook Pro M5Max. Gemma 26B with MTP drafts tokens 40% faster. Prompt: Write a Python program to find…

r/LocalLLaMA · u/TumbleweedNew6515 · 1w ago

Update on 12x32gb sxm v100 cluster / local AI for legal drafting

Update from the lawyer with the V100 server. A few of you asked what I actually ended up running once the dust settled, so here it is. Still just a lawyer, still driving the whole thing through Claude Code, still not ful…