Ollama is still the easiest way to start local LLMs, but it's the worst way to keep running them
…The nullmirror team documented their switch from Ollama to llama.cpp and found consistent throughput improvements across every model they tested, with no quality tradeoff. Their conclusion was put pretty bluntly, as…
