Search

Showing top 133 results for "Gemma 4 local use"

Google Gemma

Gemma is a family of open-weight language models released by Google for text generation and related NLP tasks.

127 articles indexed Last updated 1d ago See topic hub

Running local AI on the Raspberry Pi 5 taught me why cloud models are still winning

…Llama, Gemma, and Deepseek. I opted to install the smallest version of each model, as I only had 32GB of space on the SD Card. Using LLMs on the SBC was responsive…

Apr 30, 2026 · Charles Wolfe

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

…File "/usr/local/lib/python3.11/dist-packages/transformers/models/gemma3/modeling_gemma3.py", line 880, in forward [rank0]: logits = self.lm_head(hidden_states[:, slice_indices, :]) [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "/usr/local…

Feb 5, 2024 · Mert Toslali

I ran local LLMs on Intel's cheapest iGPU, and the results were surprisingly decent

…It ain’t perfect, but it’s a decent secondary LLM server I’ve got a Gemma4-26B-A4B instance that runs on my GTX 1080 24/7, and I use it…

May 31, 2026 · Ayush Pande

Google's $100 AI Ultra is still overpriced, and here's what I use instead

…Qwen See at Github Replacing the AI is only the first step I found a replacement for everything I use Adding a local LLM is only part of the equation, because Google…

Jun 5, 2026 · Joe Rice-Jones

I replaced Cursor and Antigravity with a completely local VS Code setup, and I missed less than I expected

…I already use one workstation for my Proxmox experiments, while the other is my main gaming/video-editing/coding machine. Then there’s the privacy advantage of hooking local LLMs up to…

Jun 3, 2026 · Ayush Pande

Google Quietly Launches Free Offline Dictation App for iOS

…performing speech recognition tasks locally. This means users can record and transcribe audio in places with poor or no network access. The app uses Google's Gemma-based speech models to convert…

Apr 7, 2026 · Devesh Beri

Local LLMs are actually good now, and I wasted months not realizing it

…Gemma 4 isn't the smartest local LLM I've run, but it's the one I reach for most Google's newest Gemma 4 models are both powerful and useful. The…

Apr 18, 2026 · Nolen Jonker

Discussions and forums

r/docker · u/CreativeCollege2815 · 3w ago

Using a Gemma4 Safetensor Already Downloaded Locally

Hi everyone. I need some help or advice. I’m learning how to use N8N, so I downloaded Docker and installed N8N locally. I also wanted to install Gemma4, which I use in ComfyUI to help with image generation prompts. Is it…

r/LocalLLaMA · u/gladkos · May 1, 2026

Qwen 3.6 27B vs Gemma 4 31B - making Packman game!

Gemma just crushed Qwen in a local LLM gamedev contest! Device: MacBook Pro M5 Max, 64GB RAM Qwen 3.6 27B: 32 tokens/sec · 18m 04s · 33,946 tokens. Gemma 4 31B: 27 tokens/sec · 3m 51s · 6,209 tokens. So what is more impo…

Hacker News · u/lostathome · 1w ago

Show HN: Hitoku Draft – Context aware local assistant

Hi guys.I have been working on Hitoku Draft, an open-source, voice-first AI assistant that runs entirely locally. I posted about it already, and now it has also transcription with voice editing. Looking for feedback, as …

15 1

Hacker News · u/limondas · 6d ago

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

Claude Code like agentic workflow ai too costly for me.Any LLM can I run with VSCode at the below setup? 16ram Intel core i7 h processor 13gen 512gb NVMe SSD I want to run the ai as local agentic workflow with Vscode.I w…

5 2

r/LocalLLaMA · u/gladkos · May 8, 2026

Followed topics

Search

Google Gemma

Running local AI on the Raspberry Pi 5 taught me why cloud models are still winning

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

I ran local LLMs on Intel's cheapest iGPU, and the results were surprisingly decent

Google's $100 AI Ultra is still overpriced, and here's what I use instead

Top stories

I tested 3 local LLMs on my RTX 4070 Ti for real work — only one earned a permanent spot

I ran a powerful local AI model on my laptop, and it didn’t feel like a compromise

Google's new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM

I ran Gemma 4 and Qwen 3.5 for the same local tasks, and one pulled miles ahead

I replaced Cursor and Antigravity with a completely local VS Code setup, and I missed less than I expected

Google Quietly Launches Free Offline Dictation App for iOS

Local LLMs are actually good now, and I wasted months not realizing it

Discussions and forums

Using a Gemma4 Safetensor Already Downloaded Locally

Qwen 3.6 27B vs Gemma 4 31B - making Packman game!

Show HN: Hitoku Draft – Context aware local assistant

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

Multi-Token Prediction (MTP) for LLaMA.cpp - Gemma 4 speedup by 40%

Updating Classifier Evasion for Vision Language Models | NVIDIA Technical Blog

New Adobe Premiere Color Grading Mode Accelerated on NVIDIA GPUs

LM Studio now lets you use your iPhone to talk to local models on your Mac - 9to5Mac