Paper page - Reinforcing Multimodal Reasoning Against Visual Degradation
…AI-generated summary Reinforcement Learning has significantly advanced the reasoning capabilities of Multimodal Large Language Models (MLLMs), yet the resulting policies remain brittle against real-world visual degradation s such as blur…