I ran local LLMs on Intel's cheapest iGPU, and the results were surprisingly decent
…Although I’d argue that with MoE offloading, Mixture of Experts models can run even on ancient systems, you’ll still need a discrete graphics card to run these bulky LLMs. But…
