5 18 1

Yang Yue

Yang130

https://yueyang130.github.io

AI & ML interests

None yet

Recent Activity

upvoted a paper about 10 hours ago

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

upvoted a paper 3 months ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

upvoted a paper 4 months ago

GenExam: A Multidisciplinary Text-to-Image Exam

View all activity

Organizations

upvoted a paper about 10 hours ago

The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning

Paper • 2601.06002 • Published 3 days ago • 34

upvoted a paper 3 months ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 141

upvoted a paper 4 months ago

GenExam: A Multidisciplinary Text-to-Image Exam

Paper • 2509.14232 • Published Sep 17, 2025 • 21

upvoted a paper 6 months ago

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20, 2025 • 85

upvoted an article 6 months ago

Article

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

Jul 10, 2025

•

upvoted a paper 7 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2, 2025 • 148

upvoted an article 7 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

Jun 3, 2025

•

307

upvoted 3 papers 7 months ago

Taming LLMs by Scaling Learning Rates with Gradient Grouping

Paper • 2506.01049 • Published Jun 1, 2025 • 38

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 187

Large Language Models for Data Synthesis

Paper • 2505.14752 • Published May 20, 2025 • 49

upvoted a paper 8 months ago

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22, 2025 • 35

authored 3 papers 8 months ago

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

Paper • 2411.02359 • Published Nov 4, 2024 • 13

Improving and Benchmarking Offline Reinforcement Learning Algorithms

Paper • 2306.00972 • Published Jun 1, 2023

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 188

upvoted a paper 8 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 188

commented 3 papers 9 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139 •

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139 •

upvoted a paper 9 months ago

CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning

Paper • 2504.13820 • Published Apr 18, 2025 • 16

authored a paper 9 months ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published Apr 18, 2025 • 139

Yang Yue

AI & ML interests

Recent Activity

Organizations

Yang130's activity

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data