Paper page - MineExplorer: Evaluating Open-World Exploration of MLLM Agents in Minecraft
…Then we organize the benchmark around a ReAct-style capability formulation and compose atomic tasks into implicit multi-hop tasks . To further construct reliable instances, MineExplorer uses a multi-agent synthesis workflow…