Paper page - SVI-Bench: A Dynamic Microworld for Strategic Video Intelligence
…Across nine tasks built from aligned video, play-by-play logs, commentary, reports, and statistics, our evaluation reveals a sharp capability cliff: current models handle localized perception reasonably well but struggle significantly…