Paper page - MARBLE: Multi-Aspect Reward Balance for Diffusion RL
…AI-generated summary Reinforcement learning fine-tuning has become the dominant approach for aligning diffusion models with human preferences. However, assessing images is intrinsically a multi-dimensional task , and multiple evaluation criteria…