Google's Gemma 4 isn't the smartest local LLM I've run, but it's the one I reach for most
…that the LLM can execute actual commands on your device. Speculative decoding gives the 31B a speed boost Though Google left performance on the table Google trained the Gemma 4 models with…