Search

Showing top 135 results for "Gemma 4 local use"

Google Gemma

Gemma is a family of open-weight language models released by Google for text generation and related NLP tasks.

112 articles indexed Last updated 1d ago See topic hub

My local LLM rewrote my resume better than ChatGPT, and it's not even close

…Simultaneously, I uploaded my resume to my local LLM, which runs the Gemma 4 model through Ollama . When I have plenty of time on my hands, I usually prefer using the Quen…

May 7, 2026 · Samarveer Singh

Google Gemma 4 in your pocket: How to run the latest AI fully offline

…us a commission. Learn more. General technology AI Google Gemma 4 in your pocket: How to run the latest AI fully offline Google's new AI Edge Gallery brings local Gemma 4…

Apr 9, 2026 · Bryan Wolfe

Google's latest trick gets Gemma 4 running 3x faster right on your phone

…This allows the model to use the memory and the compute more efficiently. Google’s recently launched Gemma 4 edge AI models are especially designed to run locally on consumer-hosted hardware…

May 6, 2026 · Tushar Mehta

Google's Gemma 4 isn't the smartest local LLM I've run, but it's the one I reach for most

…Most people who have used local models have probably dealt with the experience of open-weight models struggling to call tools consistently and effectively. Gemma 4 handles it reliably, as Google baked…

Apr 15, 2026 · Adam Conway

I tested 3 local LLMs on my actual work — and each model won at something different

…Gemma 4 is also one of the top models recommended for mobile use for this reason. Related Google's Gemma 4 isn't the smartest local LLM I've run, but it…

Apr 21, 2026 · Nolen Jonker

I stopped hitting Claude's message limit by building a local AI pipeline that does the heavy lifting

…Related I replaced the expensive Claude Pro subscription with these local models, and my productivity didn’t drop a bit Local-first vibe coding Gemma 4 writes the code And Claude makes…

May 14, 2026 · Abhinav Raj

I turned my phone into a local LLM server, and it handles vision, voice, and tool calls

…The phone is doing all the thinking All local using Termux Setup on the phone itself is surprisingly straightforward once you know what you're doing. I installed Termux from F-Droid…

Apr 21, 2026 · Adam Conway

Discussions and forums

r/docker · u/CreativeCollege2815 · 1w ago

Using a Gemma4 Safetensor Already Downloaded Locally

Hi everyone. I need some help or advice. I’m learning how to use N8N, so I downloaded Docker and installed N8N locally. I also wanted to install Gemma4, which I use in ComfyUI to help with image generation prompts. Is it…

r/LocalLLaMA · u/gladkos · May 1, 2026

Qwen 3.6 27B vs Gemma 4 31B - making Packman game!

Gemma just crushed Qwen in a local LLM gamedev contest! Device: MacBook Pro M5 Max, 64GB RAM Qwen 3.6 27B: 32 tokens/sec · 18m 04s · 33,946 tokens. Gemma 4 31B: 27 tokens/sec · 3m 51s · 6,209 tokens. So what is more impo…

Hacker News · u/lostathome · 2d ago

Show HN: Hitoku Draft – Context aware local assistant

Hi guys.I have been working on Hitoku Draft, an open-source, voice-first AI assistant that runs entirely locally. I posted about it already, and now it has also transcription with voice editing. Looking for feedback, as …

15 1

Hacker News · u/theodorehq · 2w ago

Show HN: Charm – on-device spelling, grammar, and prediction for macOS

I've spent the last year building Charm, a native macOS menu bar app that corrects spelling, fixes grammar, and predicts your next word.Three features:- Spells: NSSpellChecker plus a local LLM for context-aware correctio…

3 1

r/LocalLLaMA · u/gladkos · May 8, 2026

Followed topics

Search

Google Gemma

My local LLM rewrote my resume better than ChatGPT, and it's not even close

Google Gemma 4 in your pocket: How to run the latest AI fully offline

Google's latest trick gets Gemma 4 running 3x faster right on your phone

Google's Gemma 4 isn't the smartest local LLM I've run, but it's the one I reach for most

Top stories

Google's new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM

I ran Gemma 4 and Qwen 3.5 for the same local tasks, and one pulled miles ahead

I put Google's Gemma 4 on my homelab and Tailscale on my phone — and cancelled Claude Pro

Google's free Gemma 4 model runs on hardware you probably already own

I tested 3 local LLMs on my actual work — and each model won at something different

I stopped hitting Claude's message limit by building a local AI pipeline that does the heavy lifting

I turned my phone into a local LLM server, and it handles vision, voice, and tool calls

Discussions and forums

Using a Gemma4 Safetensor Already Downloaded Locally

Qwen 3.6 27B vs Gemma 4 31B - making Packman game!

Show HN: Hitoku Draft – Context aware local assistant

Show HN: Charm – on-device spelling, grammar, and prediction for macOS

Multi-Token Prediction (MTP) for LLaMA.cpp - Gemma 4 speedup by 40%

Lemonade 10.1 Released For Latest Improvements For Local LLMs On AMD GPUs & NPUs

A Modder Repurposed a Used V100 For LLM Acceleration

I ran local AI models on a six-year-old laptop with no GPU, and they actually worked