Residual Stream Duality in Modern Transformer Architectures Paper • 2603.16039 • Published 12 days ago • 4
FlashSampling: Fast and Memory-Efficient Exact Sampling Paper • 2603.15854 • Published 12 days ago • 8
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 624 items • Updated 5 days ago • 93
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 4 days ago • 245
Language Server CLI Empowers Language Agents with Process Rewards Paper • 2510.22907 • Published Oct 27, 2025 • 5
view article Article Lanser-CLI: Language Server CLI Empowers Language Agents with Process Rewards 🛠️🏆 Oct 27, 2025 • 1
On the Design of KL-Regularized Policy Gradient Algorithms for LLM Reasoning Paper • 2505.17508 • Published May 23, 2025 • 8
FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models Paper • 2505.02735 • Published May 5, 2025 • 33