Search

Showing top 135 results for "Gemma 4 local run"

Google Gemma

Gemma is a family of open-weight language models released by Google for text generation and related NLP tasks.

123 articles indexed Last updated 2d ago See topic hub

AMD's GAIA Defaults To Better Model, Continued Improvements For Local AI

…get local AI agents running on your AMD hardware whether it's on Windows or Linux. With GAIA 0.17.5, they have replaced Qwen 3.5 35B with Gemma 4 E4B…

May 2, 2026

Google's latest DiffusionGemma open AI model comes with a 4x speed boost

…Google says this makes it faster and more efficient when running on local hardware like an Nvidia DGX or a humble gaming GPU. Most AI models are designed to be autoregressive—they…

Jun 10, 2026 · Ryan Whitwam

Lemonade 10.1 Released For Latest Improvements For Local LLMs On AMD GPUs & NPUs

…Lemonade 10.1 is now available for Windows and Linux users to assist in running local AI apps primarily with AMD Ryzen and AMD Ryzen AI hardware while also being able to…

Apr 7, 2026

NVIDIA Delivers Day-1 Support For DeepMind's DiffusionGemma Open Model Across RTX & DGX Platforms, 150 Tokens/s With DGX Spark

…with Google’s Gemma 4 architecture. Up to 4x faster performance: The boost means fast text generation, where single-user generation usually stalls — on local hardware. Open and local: DiffusionGemma is open…

Jun 10, 2026 · Hassan Mujtaba

I ran local AI models on a six-year-old laptop with no GPU, and they actually worked

…Sign in to your XDA account Summary Local AI runs on modest PCs - no RTX needed; efficient small models work on CPU and iGPU. Sub-1B models feel instant for simple tasks…

Jun 5, 2026 · Samarveer Singh

A Modder Repurposed a Used V100 For LLM Acceleration

…But apparently someone has made an SXM2-to-PCIe adapter, and with that and a cooling mod, the V100 was more than capable of running LLMs locally. The adapter cost another $100…

May 11, 2026 · Jon Martindale

I used my local LLM to sort hundreds of gaming clips, and it was the laziest solution that worked

…I wrote a Python script that uses Gemma 4 31b, running locally on my PC, to identify the game in each clip and sort the files automatically. It actually works, and it…

Apr 15, 2026 · Adam Conway

Discussions and forums

Hacker News · u/joas_coder · 2w ago

Show HN: I made a Gemma 4 Mac app that names screenshots with local AI

I made my first macOS utility app that ships with a bundled Gemma 4 model, specifically the Gemma E4B one. It made my app DMG have 5.3 GB in size, but I think it is a small size for the power that this free local model c…

7 6

r/LocalLLaMA · u/gladkos · May 1, 2026

Qwen 3.6 27B vs Gemma 4 31B - making Packman game!

Gemma just crushed Qwen in a local LLM gamedev contest! Device: MacBook Pro M5 Max, 64GB RAM Qwen 3.6 27B: 32 tokens/sec · 18m 04s · 33,946 tokens. Gemma 4 31B: 27 tokens/sec · 3m 51s · 6,209 tokens. So what is more impo…

Hacker News · u/lostathome · 1w ago

Show HN: Hitoku Draft – Context aware local assistant

Hi guys.I have been working on Hitoku Draft, an open-source, voice-first AI assistant that runs entirely locally. I posted about it already, and now it has also transcription with voice editing. Looking for feedback, as …

15 1

r/LocalLLaMA · u/TumbleweedNew6515 · 2w ago

Followed topics

Search

Google Gemma

AMD's GAIA Defaults To Better Model, Continued Improvements For Local AI

Google's latest DiffusionGemma open AI model comes with a 4x speed boost

Lemonade 10.1 Released For Latest Improvements For Local LLMs On AMD GPUs & NPUs

NVIDIA Delivers Day-1 Support For DeepMind's DiffusionGemma Open Model Across RTX & DGX Platforms, 150 Tokens/s With DGX Spark

Top stories

I ran a powerful local AI model on my laptop, and it didn’t feel like a compromise

Run Google's Gemini LLMs right on your Mac with the new AI Edge Gallery

Google's new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM

I ran Gemma 4 and Qwen 3.5 for the same local tasks, and one pulled miles ahead

I ran local AI models on a six-year-old laptop with no GPU, and they actually worked

A Modder Repurposed a Used V100 For LLM Acceleration

I used my local LLM to sort hundreds of gaming clips, and it was the laziest solution that worked

Discussions and forums

Show HN: I made a Gemma 4 Mac app that names screenshots with local AI

Qwen 3.6 27B vs Gemma 4 31B - making Packman game!

Show HN: Hitoku Draft – Context aware local assistant

Update on 12x32gb sxm v100 cluster / local AI for legal drafting

Ask HN: Any Local LLM can I run without GPU for Local Agentic workflow AI?

I let two local LLMs fight over how to optimize a Linux VM, and they destroyed it instead

I tested 3 tiny local LLMs for everyday work, and only one of them impressed me

Google battles Chinese open weights models with Gemma 4