Mixture-of-experts models quietly changed what hardware you need for local AI
…the reasoning quality that made you pick a particularly large model. Offloading a few layers to the system RAM also helps, but performance tanks, as a result. So, unless your GPU could…