Claude, ChatGPT, and Gemini get all the hype, but the most interesting AI models are coming from elsewhere
…It's a 230B mixture-of-experts with only 10B active parameters per token, eight of 256 experts routed per token, a 205K context window, and a near-GPT-5.3-Codex…
