I turned my phone into a local LLM server, and it handles vision, voice, and tool calls
…Google's newest open-weights model family has two mobile-tier variants, E2B and E4B, designed specifically for on-device inference. They've got multimodal input (text, image, and audio), a 128K…
