Paper page - AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward
…https://huangrh99.github.io/AlphaGRPO/ View arXiv page View PDF Project page GitHub 50 Add to collection Community AlphaGRPO enables multimodal generation RL training across text and image generation for AR-Diffusion…