Running Ollama on a 15W CPU sounded ridiculous until I got it working with decent results
… For Qwen3 on my compact system, the 4B model managed around 4 tok/s with a simple question, and when asked what XDA Developers is. Not brilliant, but more than sufficient for loading queries while doing something else. The Intel Core i5-10210U was never designed with local LLMs in mind. …