OpenAI announces GPT‑5.6 Sol, its next-generation flagship model beating Claude Mythos 5
…It delivers better results in agentic performance across coding, biology, and cybersecurity. On the Terminal-Bench 2.1 benchmark, which tests command-line workflows requiring planning, iteration, and tool coordination, GPT-5…