Paper page - InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?
…A Framework for Benchmarking and Improving Coding Agents for Robot Manipulation (2026) Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification (2026) Coding with Eyes: Visual Feedback Unlocks Reliable GUI…