Google's Gemma 4 isn't the smartest local LLM I've run, but it's the one I reach for most
…It's hard not to read that as a deliberate decision to keep the best inference performance locked to their own framework. Community workarounds are already appearing for this, though. A team…