TempFlow-GRPO: When Timing Matters for GRPO in Flow Models Paper • 2508.04324 • Published Aug 6, 2025 • 11
Flow-GRPO: Training Flow Matching Models via Online RL Paper • 2505.05470 • Published May 8, 2025 • 86
FlowRL: Matching Reward Distributions for LLM Reasoning Paper • 2509.15207 • Published Sep 18, 2025 • 114