PonderLM-2: Pretraining LLM with Latent Thoughts in Continuous Space Paper • 2509.23184 • Published Sep 27, 2025 • 1
FlowRL: Matching Reward Distributions for LLM Reasoning Paper • 2509.15207 • Published Sep 18, 2025 • 114
Pretraining Language Models to Ponder in Continuous Space Paper • 2505.20674 • Published May 27, 2025 • 2