Paper page - ATLAS: Agentic or Latent Visual Reasoning? One Word is Enough for Both
…Ziyu Guo , , , Abstract ATLAS presents a visual reasoning framework that combines agentic operations and latent representations using functional tokens, enabling efficient training and improved performance on complex benchmarks. AI-generated summary Visual…