Search

Showing top 132 results for "Gemma 4 local use"

Google Gemma

Gemma is a family of open-weight language models released by Google for text generation and related NLP tasks.

127 articles indexed Last updated 22h ago See topic hub

Two days without internet taught me why local LLMs are worth the setup

…Local LLMs aren't really necessary, but they were in this case Most of us can skip local LLMs and be completely fine without them. They're not actually necessary, but we…

Jun 15, 2026 · Nolen Jonker

…in industrial and robotic systems, NVIDIA Nemotron speech models are used for fast and accurate natural voice interactions. Qwen3 4B, served locally via vLLM, interprets requests and generates responses with low latency…

May 7, 2026

Google's excellent offline AI app just got even better with three big features

…10 PM,’ it schedules a local notification. When you tap that notification, the app opens directly to the right tool and starts a session with Gemma 4, ready to help. In other…

May 20, 2026 · Hadlee Simons

My self-hosted LLMs are a lot more than just a chat replacement – here's how they boost my productivity

…Local LLMs aren’t just chatbot replacements Having spent hours configuring the right LLM pipelines for my FOSS models, I have to admit that local AI tools are far more useful than…

May 25, 2026 · Ayush Pande

Android Studio I/O Edition: What’s new in Android Developer tools

…Android Studio detects your subscription automatically when you log in with your Google account. Use your Google AI plan in Agent Mode Gemma 4 for local code assist and on-device AI…

The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has

…So what does this tell us? Well, your local AI agent doesn't need to be big. It just needs to be good at calling tools. Model size doesn't predict tool…

Jun 10, 2026 · Adam Conway

I replaced cloud LLMs with local models running off a Proxmox LXC, and the performance trade-off was worth it

…I can put my old graphics cards to good use Like most LLM-hosting enthusiasts, I started my journey by hosting local models on Ollama , and it served me well for the…

May 29, 2026 · Ayush Pande

Discussions and forums

r/docker · u/CreativeCollege2815 · 2w ago

Using a Gemma4 Safetensor Already Downloaded Locally

Hi everyone. I need some help or advice. I’m learning how to use N8N, so I downloaded Docker and installed N8N locally. I also wanted to install Gemma4, which I use in ComfyUI to help with image generation prompts. Is it…

r/LocalLLaMA · u/gladkos · May 1, 2026

Qwen 3.6 27B vs Gemma 4 31B - making Packman game!

Gemma just crushed Qwen in a local LLM gamedev contest! Device: MacBook Pro M5 Max, 64GB RAM Qwen 3.6 27B: 32 tokens/sec · 18m 04s · 33,946 tokens. Gemma 4 31B: 27 tokens/sec · 3m 51s · 6,209 tokens. So what is more impo…

Hacker News · u/lostathome · 1w ago

Show HN: Hitoku Draft – Context aware local assistant

Hi guys.I have been working on Hitoku Draft, an open-source, voice-first AI assistant that runs entirely locally. I posted about it already, and now it has also transcription with voice editing. Looking for feedback, as …

15 1

Hacker News · u/limondas · 5d ago

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

Claude Code like agentic workflow ai too costly for me.Any LLM can I run with VSCode at the below setup? 16ram Intel core i7 h processor 13gen 512gb NVMe SSD I want to run the ai as local agentic workflow with Vscode.I w…

5 2

r/LocalLLaMA · u/gladkos · May 8, 2026

Followed topics

Search

Google Gemma

Two days without internet taught me why local LLMs are worth the setup

Robotics Archives

Google's excellent offline AI app just got even better with three big features

My self-hosted LLMs are a lot more than just a chat replacement – here's how they boost my productivity

Top stories

I tested 3 local LLMs on my RTX 4070 Ti for real work — only one earned a permanent spot

I ran a powerful local AI model on my laptop, and it didn’t feel like a compromise

Google's new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM

I ran Gemma 4 and Qwen 3.5 for the same local tasks, and one pulled miles ahead

Android Studio I/O Edition: What’s new in Android Developer tools

The biggest local LLM on your machine is useless if it can't call a single tool, no matter how many parameters it has

I replaced cloud LLMs with local models running off a Proxmox LXC, and the performance trade-off was worth it

Discussions and forums

Using a Gemma4 Safetensor Already Downloaded Locally

Qwen 3.6 27B vs Gemma 4 31B - making Packman game!

Show HN: Hitoku Draft – Context aware local assistant

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

Multi-Token Prediction (MTP) for LLaMA.cpp - Gemma 4 speedup by 40%

I ran this bulky LLM on an SBC cluster, and it's the most unhinged setup I've ever built

Open models stripped bare by safety-busting tools – Fudzilla.com

Running local AI on the Raspberry Pi 5 taught me why cloud models are still winning