Open R1: Update #2
Thanks for sharing your results and describing the background of what happened around GRPO research last week. Do you plan to test classical distillation, not just fine-tuning on reasoning traces? · Yes…
To show you the most relevant results, we’ve omitted some entries very similar to those already shown. Repeat the search with the omitted results included.