Google's $100 AI Ultra is still overpriced, and here's what I use instead
…That pricing shift is unsustainable, and I've started running a local LLM as a hybrid model to offset the costs. It's not as fast, or as capable, but it's…
Tracked topic
Gemma is a family of open-weight language models released by Google for text generation and related NLP tasks.
…That pricing shift is unsustainable, and I've started running a local LLM as a hybrid model to offset the costs. It's not as fast, or as capable, but it's…
…Related I ran local LLMs on Intel's cheapest iGPU, and the results were surprisingly decent It ain't no match for a dedicated GPU, but you can run some light LLMs…
…Related Google's Gemma 4 isn't the smartest local LLM I've run, but it's the one I reach for most Google's newest Gemma 4 models are both powerful…
…Related 7 self-hosted services I use that can run perfectly on a Raspberry Pi Not every self-hosted application requires a top-of-the-line workstation Open WebUI + local LLMs provide…
…And the option that really pulled me in was BYOK with any OpenAI-compatible endpoint, which means you can also point it at a local model running through something like LM Studio…
…While Falcon, Gemma (Google), and Mistral are all legitimate open-weight models you can run locally, Meta's Llama series is arguably the most widely adopted and has the largest ecosystem of…
…The first is by connecting a browser extension to a local server running on your machine. The second is to use extensions that run models directly in the browser. I went with…
…In my own workflow, I run a locally hosted 24B variant of Gemma 4 for routine generative tasks, such as first drafts and boilerplate, and reserve Claude strictly for the features that…
…While Falcon, Gemma (Google), and Mistral are all legitimate open-weight models you can run locally, Meta's Llama series is arguably the most widely adopted and has the largest ecosystem of…
…There are other options, like LM Studio, that also power local AI setups. Open WebUI Bring the ChatGPT experience to your own local hardware While Ollama runs the models, Open WebUI is…