I replaced GitHub Copilot with a self-hosted AI and I won’t go back
…run local LLMs to handle the tasks I was using Copilot for, and it's never been easier. There's also never been more choice for local use, whether that's Gemma…
Tracked topic
Gemma is a family of open-weight language models released by Google for text generation and related NLP tasks.
…run local LLMs to handle the tasks I was using Copilot for, and it's never been easier. There's also never been more choice for local use, whether that's Gemma…
…Sign in to your XDA account I got into local LLMs because it seemed interesting, if I’m being completely honest. Running my own AI on my own hardware just felt like…
…While Falcon, Gemma (Google), and Mistral are all legitimate open-weight models you can run locally, Meta's Llama series is arguably the most widely adopted and has the largest ecosystem of…
…Boichat discovered those latest malware bits using a local Gemma E2B AI model. The new malware attempt in AUR was described as "a bit more elaborate" in obfuscating the action around the…
…version seems to be built on Gemma 3. There's not much difference between the models for what OMNI can handle, and they run locally on your device, so you get privacy…
…Configure local API endpoint Inside the plugin settings, I entered the local Ollama host URL. Since Ollama runs locally, the plugin can directly communicate with the model through the API. This helps…
…nvcr.io/nim/google/diffusiongemma-26b-a4b-it:latest” $ docker run --gpus=all \ -e NGC_API_KEY=$NGC_API_KEY \ -v "$LOCAL_NIM_CACHE:/opt/nim/.cache" \ -p 8000:8000 \ ${NIM_IMAGE…
I made my first macOS utility app that ships with a bundled Gemma 4 model, specifically the Gemma E4B one. It made my app DMG have 5.3 GB in size, but I think it is a small size for the power that this free local model c…
Gemma just crushed Qwen in a local LLM gamedev contest! Device: MacBook Pro M5 Max, 64GB RAM Qwen 3.6 27B: 32 tokens/sec · 18m 04s · 33,946 tokens. Gemma 4 31B: 27 tokens/sec · 3m 51s · 6,209 tokens. So what is more impo…
Hi guys.I have been working on Hitoku Draft, an open-source, voice-first AI assistant that runs entirely locally. I posted about it already, and now it has also transcription with voice editing. Looking for feedback, as …
Update from the lawyer with the V100 server. A few of you asked what I actually ended up running once the dust settled, so here it is. Still just a lawyer, still driving the whole thing through Claude Code, still not ful…
Claude Code like agentic workflow ai too costly for me.Any LLM can I run with VSCode at the below setup? 16ram Intel core i7 h processor 13gen 512gb NVMe SSD I want to run the ai as local agentic workflow with Vscode.I w…
…up vLLM as the backend, with Gemma 4's mix-of-experts model to begin with, and Open WebUI as the front end. It's running on Ubuntu via WLS2 , partly because…
…Performance-wise, this setup is able to generate a 15ish minute podcast with three speakers in roughly 20 minutes, which is pretty impressive considering that everything runs on a local AI pipeline…
…gap was smaller than I expected Where the local LLM fits in LM Studio runs the server, Fabric sends the prompts The local LLM part of the setup actually happens inside that…