I finally found a local LLM I want to use every day (and it's not for coding)
…Related I replaced my local LLM with a model half its size and got better results — And it wasn't about the parameters I switched from a 20B model to a 9B…
…Related I replaced my local LLM with a model half its size and got better results — And it wasn't about the parameters I switched from a 20B model to a 9B…
…AI-generated summary Vision-Language Models (VLMs) have advanced rapidly in multimodal perception and language understanding , yet it remains unclear whether they can reliably ground language into spatially coherent, plausibly executable actions…
…Then the hammer dropped. AI labs have moved to a "compute-based" usage model, which factors in complexity, tool calling, and how long your chats are into the usage cap. It's…
…Anthropic, and others D Only locally hosted open-source models Correct! Fabric supports multiple AI backends, including OpenAI, Anthropic (Claude), Google Gemini, Ollama for local models, and more. This flexibility means users…
Linux May Drop Old Network Drivers Now That AI-Driven Bug Reports Are Causing A Burden Written by Michael Larabel in AI on 21 April 2026 at 03:45 PM EDT. 65…
…Who they’re for Developers and AI enthusiasts Veteran creators and engineers Customers exploring local AI at scale Example workloads Running and fine-tuning AI models locally AI-assisted video, 3D, and…
…personal AI workflows locally on a Ryzen AI PC How to connect local agents to frontier models deployed on AMD Instinct GPUs How to route tasks between local and cloud models based…
Hi HN, I'm Antoine Zambelli, AI Director at Texas Instruments.I built Forge, an open-source reliability layer for self-hosted LLM tool-calling.What it does:- Adds domain-and-tool-agnostic guardrails (retry nudges, step e…
Recently I was using functiongemma and watched it load and run local source code as a tool call without any training/tuning. A couple days later I got Qwen35 in Open-WebUI to use the "native" tool-calling. With Open-WebU…
!UPDATE!(20.05.2026) WE HAVE NEW NUMBERS FROM 1.500+ TESTS IT'S WORKING! check my update post https://www.reddit.com/r/LocalLLaMA/s/AyNOehjkYT Or the go straight to the my Github https://github.com/OttoRenner/Gentle-Codi…
There is a lot of disdain for DGX Sparks here on the sub. And I get it. A lot of people say “It could have been great if it had been better memory bandwidth”, “SM-121 is a fake /second-class Blackwell chip” yadda, yadda.…
Hi HN! Pierce here.Rotunda is a firefox fork primarily intended for agent use, which I’ve been hacking on nights/weekends.There was a [lengthy](https://news.ycombinator.com/item?id=48024859) discussion last week on how e…
…The local model is not used by that AI, instead it powers features like “Help me write”. Hanff says that the silent installation of the model could potentially be illegal in several…
…The local memory capacity that variable graphics memory provides makes that possible, even for the largest and most complex models. With AMD Ryzen™ AI Max+ PRO processors, engineering teams can process large…
Microsoft CEO says its AI data centers consume less water than your local diner