Paper page - Injecting Distributional Awareness into MLLMs via Reinforcement Learning for Deep Imbalanced Regression
…Yao Du , , Abstract A distribution-aware reinforcement learning framework improves multimodal large language models' numerical regression performance on long-tailed distributions through batch-level comparison-based supervision. AI-generated summary Multimodal large…